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These are the lecture notes for quantum and statistical mechanics courses that are given by DC at 
Ben-Gurion University. They are complementary to Lecture Notes in Quantum Mechanics [arXiv: 
quant-ph/0605180]. Some additional topics are covered, including: introduction to master equations; 
non-equilibrium processes; fluctuation theorems; linear response theory; adiabatic transport; the 
Kubo formalism; and the scattering approach to mesoscopics. This is a first draft. Typos are likely. 
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The canonical formalism 
[1] The statistical picture of Mechanics 

= [1.1] Random variables 



Here is a list of topics that should be covered by a course in probability theory: 
Random variable/observation x 
Distribution function p(x) 
for discrete spectrum p(x) = Prob (x = x) 
for continuous spectrum p(x)dx = Prob (x < x < x + dx) 
Changing variables y — f (x), p (y) dy — p{x)dx 

Expectation values (x) = ~^2 x p{x)x, and more generally (A) = ~^2 x p(x)A(x) 

Variance Var(f) = ((x — (x)) ) = (x 2 ) — (x) 2 
Moment generating function Z(X) = (c Xx ) 

Comulant generating function 17(A) is defined through Z{\) = exp[<7(A)] 
Gaussian distribution p(x) ex exp[i((a; — p)/a) 2 ], 5(A) = pX + \cr 2 \ 2 
Joint Distribution function of two variables p (x, y) 
Correlation function C xy — (xy) 

Adding independent random variables: 

S = x + y (1) 

(S) = (£) + (y) (2) 

Var(5) = Var(x) + Var(y) (3) 

g s (\) = g x {\) + g y (\) (4) 

• Central limit theorem. 

• Remark on quantum postulates [see arXiv: quant-ph/0605180] 

= [1.2] Statistical description of a classical particle 

The statistical state of a classical particle is described by a probability function: 

p( x jP)~^ — — PROB (x < x < x + dx,p < p < p + dp) (5) 

where the normalization is 
f f dxdp 



JJ^ 



p(x,p) = 1 (6) 



Expectation values of observables are defined in the usual way: 

(A) = JJ*^p{x,p) A(x,p) (7) 



We note that in the quantum context one can define a quasi distribution that corresponds to p(x,p), known as 
the Wigner function. Furthermore with any observable A we can associate a phase apace function A(x,p) such 
that the expectation value can be calculated using classical look-alike formulas. This is known as the Wigner- Weyl 
formalism. This formalism can be regraded as generalization of WKB: Roughly speaking one may say that each 
Planck cell in phase space can be regarded as representing a quantum state. The volume of Planck cell is (2irH) d 
where d is the number of freedoms. Above we have assumed d = 1. Note that the normalization convention allows 
a sloppy interpretation of p (x,p) as the probability to occupy a Planck cell in phase space, we also remark that the 
quantum requirement trace(p 2 ) < 1 implies that a wavepacket in space space cannot occupy a volume that is less than 
a Planck cell. The probability function of x is 

P(x) = Jt^P&p) (8) 

The " spreading" of a wavepacket is characterize by 

ol = Var(x) = ((x-(x)) 2 ) = (x 2 ) - (£) 2 (9) 

ol = Var(p) = ((p-(p)f) = (P 2 )-(p) 2 (10) 

In the quantum context a x a y > (ft/2). The "energy" of the system is defined as follows: 

E = (H(x,p)) = JJ ^ p(x,p) H(x,p) (11) 

Later we shall define some other "spectral" functions that arc related to "H. Those can be written as an expectation 
value of functions of % . 



[1.3] Dynamics in phase space 



The Hamiltonian of a free particle is 
P 2 

n = L (12) 

Say that at t = the particle is at (xo,Po)- The equations of motion arc 

dn p 

x = — = - (13) 

op m 

The solution is: 

x(t) = x Q H po (15) 

p(t) = p Q (16) 

There are two points of view with regard to the evolution: (i) Heisenberg: the evolution is of random variables; (ii) 
Schrodingcr: the evolution is of a probability functions. In statistical mechanics we prefer the latter point of view. 
Evolution takes place in phase space. Liouville theorem applies. Let us see how we use the "Schrodinger picture" 
in the above example. The free particle has been prepared in a "deterministic preparation" at the point (xq,Po) in 
phase space. Accordingly 

Pt=o(x,p) = 2wd(p-po) 6(x-x ) (17) 



After time t the state is 



p t (x, p) = 2nS {p-po) S (x - ( x + — t) ) 



(18) 



Above pt (x,p) is the probability function of the random variables 



x t = x H pb 

m 

Pt = Po 



(19) 
(20) 



If the preparation is not deterministic one observes "spreading" 



<?At) = \K(o) + 



a p (0) 



gg(0) t 



(21) 



It should be clear that "spreading" is a classical dynamic effect. In quantum mechanics this effect is unavoidable 
because deterministic preparations are excluded. 

PA 



|[ 



More generally we can discuss the spreading of a wavepacket in the case of a non-linear oscillator. The Hamiltonian 

is 



n 



v 

2m 



V{x) 



(22) 



The energy V mia < E < oo is a constant of motion. The spreading leads to a quasi-ergodic distribution within an 
energy shell. It is not really ergodic because if we start with a mono-energetic distribution it will not fill uniformly 
the energy surface (which is a "circle"). 

V(x) 





[1.4] The route to ergodicity 



This section will be expanded in future version. It outlines some major observation with regard to the dynamics of 
classical Hamiltonian systems. 

Simple ID system:— The student is expected to be familiar with the dynamics of harmonic oscillator; potential 
well; pendulum. In the case of non-linear oscillations we have the spreading effect. In the case of a pendulum we 
have a multi-component phase space with separatrix. The dynamics is not chaotic. One can define the oscillation 
frequency ui(E) as a function of energy. In the quantum case ui(E) corresponds to the level spacing at the vicinity of 
the energy E. 

Chaotic system:— The student is expected to be familiar with the dynamics in simple billiards. The visualization 
can be achieved using a Poincare section. In the case of a Sinai billiard (motivated by the discussion of Lorentz gas) 
the dynamics is fully chaotic, leading to crgodization. More generally we might have mixed phase space that contains 
"chaotic sea" as well as "islands". 

Ergodization:— The evolution of a chaotic system leads to an ergodization on the energy shell. This can be 
mathematically described using the Boltzamnn approach: course graining of phase space by dividing it into cells; 
definition of Boltzamnn entropy. Eventually the system will become stationary-like, as if it were prepared in a state 
that has maximum entropy. 

Driven system:— There is a complicated route to chaos in the case of driven integrable (ID) systems. In contrast 
to that in the case of driven globally chaotic systems the picture is qualitatively simple: if we prepare the system 
initially within an energy shell, it will "evolve" with this energy shell, along with diffusion transverse to the energy 
shell. This diffusion leads in general to increase of the average energy (heating). 



Spreading 
free particle 



consider a mono-energetic preparation of wavepacket in a billiard 





Spreading, non-linear oscillator 



Poincare section 





& 



energy surface 



no spreading for mono-energetic distribution 
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[1.5] Stationary states 



The evolution of a statistical state is determined by the Lioville equation of classical mechanics, which becomes the 
von-Neumann Lioville equation in quantum mechanics. 



dp 

dt 



[H,p] 



(23) 



We consider non-driven bounded systems, and focus our attention on stationary states that do not change in time. 
This means dp/dt = 0. In the classical language p can be regarded as a mixture of "energy shells", while in the 
quantum language it means that p t— > diag{p„} is a mixture of energy eigenstates. In particular the microcanonical 

state is 



p(x,p) = -—6(U(x,p)-£) 

g(£) 



(24) 



and the canonical state is 

p(x,p) = zWf~ m{x ' p) (25) 

where the density of states and the partition function arc defined as 

g(E) = trace(5(E-H)) = J^5(E - E n ) (26) 

n 

Z(p) = traceCe-^) = J^e""*" = f g{E)dE e~ &E (27) 

n •> 

It is convenient to write the probability density to find the system at some particular energy E as 

p{E) = jjd^ p{xp)5{En{xp)) ^ g{E )f(E) (28) 

where the respective occupation probability functions are 

f(E) = Jgr6(E-£) [microcanonical] (29) 

f( E ) = zWJ c ~ PE [canonical] (30) 

If we have a many body system of non-interacting participles we can re-interpret f(E) as the occupation function, 
and accordingly p(E) becomes the energy distribution of the particles (with normalization N) . 

^^^^= [1.6] The microcanonical and canonical states 

Let us assume the following total Hamiltonian for a universe that consists of system and environment: 

fttot.1 = n(Q) + n cm (Q a ) + n int (Q,Q a ) (31) 

For sake of presentation we do not write the conjugate momenta, so Q stands for (Q, P) or it may represent spin 
freedoms. If one neglect the interaction the eigenenergies are written as E r R = E r + -Er, where r labels system states 
and R labels environmental states. 

It is argued that the weak interaction with the environment leads after relaxation to a canonical state which is 
determined by the parameter 

/3=^log(g onv (£)) (32) 

where g onv (-E) is the density of states, which is assumed to be huge and rapidly growing with energy. The argument 
is based on the assumption that the universe (system+environment) is (say) a closed system with some total energy 
£" total . After ergodization the system get into a stationary- like state that resembles a microcanonical states: 

Pr,R « s(^E to ^-(E r + E R )^ (33) 

with finite width (broadened) delta function. The probability p r to find the system in a state E r is proportional to 

g,UE t ,^-E r ) »5„(£L)e*. Accordingly 

Pr = V<^ (34) 



where the so-called partition function provides the normalization 

Z((3) = J2^ E " (35) 

r 

The partition function may depend on parameters that appear in the system Hamiltonian. Therefore we use in general 
the notation Z(/3,X). 

= [1.7] Mathematical digression 



Sometimes is is more appropriate to expand the log of a function. Specifically this would be the case if the function 

is definite positive and span many decades. Let us see what is the error which is involved in such an expansion: 

f{x) = x N (36) 

/ (x + 6x) =x N + Nx N - 1 Sx + N 2 x N - 2 8x 2 (37) 

Sx < x/N (38) 

Optionally we expand the log of the function: 

S(x) = ]nf(x) = Nln(x) (39) 

S (x + Sx)=N ln(x) + — Sx + %5x 2 (40) 

X X 2 

Sx < x (41) 
Thus we have the recipe: 

f(x + Sx) w f(x)eP Sx (42) 



(43) 



(44) 
where j3 is the log derivative of the DOS. 



where 








P = 


d\nf(x) 
dx 




In 


complete 


! analogy we have: 




g(E 


+ e) w g 


(E )e^ 



[2] Spectral functions 



Various types of spectral functions are defined in mathematical physics. In the quantum context they characterize 
the spectrum {E n } of energies of as given Hamiltonian %. In the continuum or classical limit it is essential to define 
a measure. Blow we focus on the most popular spectral functions in statistical mechanics: the density of states g{E), 
and the partition function Z(/3). We shall see later that the state equations of a system in equilibrium can be derived 
from, say, the partition function. Hence the spectral function serves as a generating function. 

In the section below we define g(E) and Z(/3), and show how they are calculated suing standard examples: Two 
level system; Harmonic oscillator; Particle in a box; Particle with general dispersion relation; The effect of A{x), V{x) 
potential; Several particles; Identical classical particles, the Gibbs factor; Particles with interactions; In particular 
two quantum particles; Molecules of type AA and AB (exercise) . 

^^^^= [2.1] The definition of counting and partition functions 



We consider a time independent bounded system which is described by a Hamiltonian T-L whose eigenvalues are E r . 
We can characterize its energy spectrum by the functions 



N{E) = £0(£-£ r ) = £ 1 (45) 

r E r <E 

Z((3) = J2 e ~ PEr ( 46 ) 



If we have a large system we can smooth N(E) , and then we can define the density of states as 

m - ^ = E*(£-^) (47) 



Note that 

Z(fi) = fg(E)dEe-P E (48) 

For a classical particle in ID we can use the above definitions with the prescription 

Each "Planck cell" in phase space represents a state. Accordingly 

mE) = [[*^ e{ E-H{z,p)) = // ^ (50) 



27r " J Ju(x,p)<E 27r 

Z(.i) = jl^e-m**) = j g {E)dEe^ E (51) 

In what follows the Gaussian integral is useful: 

e-l^dx = (f)" (52) 



= [2.2] Two level system or spin 

The Hamiltonian of spin 1/2 in magnetic field is 

n = l^s (53) 

The eigenstates are |+) and |— ) with eigenvalues E± = ±e/2. Accordingly 

Z{p) = e - /3 (-f)+c- /3 (f) = 2coshf -efi\ (54) 

Optionally we can write the energies of any two level system as E, r = en with n — 0, 1 then 

Z{/3) = (l + e-e<) (55) 



= [2.3] Two spins system in interaction 

If we have N is interacting spins the sum over states can be factorized and we simply get 

Z N (f3) = (Z^p)) 1 " (56) 



For two spins in the absence of magnetic field we get Z2 = 2 2 = 4. Let us see what happens if there is an interaction: 
n = ea a -a b = (25 2 - 3) e (57) 

where S — \o a + \<r '. The energy levels are E S i ng i et — — 3e and E tr i P iet — £■ Accordingly 

Z{p) = e 3fie + 3e- pe (58) 



[2.4] Harmonic oscillator 



The Hamiltonian of Harmonic oscillator is 

n = ^- + -moj 2 x 2 (59) 

zm 2 

The eigenstates are \n) with eigenvalues E n — (j + n) u>. Accordingly 

00 ^ 

Z(/3) = J2^ {h+n)u = 9 ■ wi ^ ( 6 °) 

^0 2sinh( 5 w/3) 

Note that if we write the energies as E r — wn with n = 0, 1, 3, 4, ... then 

Z W = 13^ ( 61 ) 
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Now let us see how the classical calculation is done. 



M(E) = — ellipse area = — ir ( =) (2mE) * = - (62) 

2-k 2tt \ mijj z J u 

m - /*.-<»-/g.-* - (^)W " = <-> 

One can verify the validity of WKB quantization. 



2.5] Particle in a ID box 



The simplest is to assume periodic boundary conditions 

n = f- xe[0,L] (ring) (64) 

The eigenstates are the momentum states \p) with 

2ir 
p = -f-n where n = 0, ±1, ±2... (65) 

Hence the eigenvalues are 
1 /2tt n ' 



E n = -{ T n) 

The number of states up to energy E is 

N{E) = 2^(2m£)5 = -k E L = 2-— (67) 

Z7T 7T Aft 

The density of states is 

g{E) = — (68) 

TTVe 

The ID case here is pathological because in general the density of states grows rapidly with energy. Nevertheless in 
the limit of " infinite volume" we may treat the spectrum as a continuum: 

Let us see how the calculation is carried out classically. We can still consider a ring, or optionally we can write the 
Hamiltonian with a box potential Vl(x). Then we get 

M{E) = rectangle area = — xLx2{2mE)^ = 2— (70) 

2tt \e 
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One can verify the validity of WKB quantization. 

= [2.6] A particle in 3D box, or higher dimensions 

Consider a particle in a d—3 box of volume V = L d . 

3 2 

% = 2_\ 7T~ + [implicit boundary conditions with volume L d ] (72) 

The eigenstates are 

P = -^(«i,n 2 ,n 3 ) (73) 

E ni n 2 n 3 = 7- l-r K + n l + n D ( 74 ) 



2m V L , 
The summation over the states factorizes: 

3(0) = £ e"^™ = [^e-^] = ^ (75) 

The above calculation is easily generalized to d > 1 dimensions. Furthermore, as far as the classical calculation is 
concerned N particles is formally like d t— > Nd. The counting function is 

and accordingly 

g (E) = "- (feLJ^A ex i^/ 2 )" 1 (77) 

(2TT) a Ve 

Note that for (ksL) ~ can be interpreted as the number of modes. Note also that for d = 2 the DOS is independent 
of energy and reflects the mass of the particle. 

= [2.7] Classical particle in magnetic field 

For a particle in an arbitrary scalar potential V(r) in 3D we get 

ZW) . /**.-« . (i)*/*.-™ (78) 

Let us include also a vector potential: 

«=^(p-eA(aO) 2 + V(r) (79) 

2m 
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Z 



dr d P -p[&( P -eA(x))*+v(x)] = f drdp' -^( P f+y(x)] 



(2nY 



(27T) 



(80) 



The result does not depend on A{x). The energy spectrum is not affected from the existence of A(x). The energies 
are E = (l/2)mv 2 + V(x) irrespective of A(x). To explain that the implicit assumption is having relaxation processes 
that makes the dynamics irrelevant. The classical diamagnetic effect disappears in the presence of relaxation. To 
discuss the effect of quantization. 



[2.8] Gas of classical particles in a box 



Let us consider N particles: 



U 



N 

E 



2m 



+ V(r a ) 



+ U(ri,...,r N ) 



(81) 



In the absence of interaction the partition function is 

r 1 



Z N (/3) = (Z x {p)) 



\rp 



i A r 



d 3 x e-WM 



(82) 



From now on we assume gas of identical particles and therefore include the Gibbs factor: 



Z N (f3) * ^Z N ((3) 



(83) 



For N interacting particles we get 



ZnW) 



1 ( 1 



(±Y f d ri ...dr N e-M*—™) 



(84) 



[2.9] Two quantum identical particles 



Let us see what is the partition function for a system that consists of two identical particles, say in a box. The total 
energy is written as E a b = E a + Ef,. The partition function is 



Z(P) 



l Ec -,(^) + (l) E 



a=£b 



Ue- 



-fi(2E a 



-B(E„+E h 



±E 



-2/3E a 



1 r 

2 



Z 1 (pf±Z 1 (2P) 



(85) 
(86) 



For a particle in a d dimensional box 



Zi = 



L 
At 



Z 2 = \ (Z\ ± 2- d l 2 Z 1 



(87) 
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Note that for d — 3 we get 

1 



Z 2 ((3) = ^Z 2 x 



1± 



1 / A3 



2 3 / 2 y volume 
The Fermi case is similar to hard sphere: 

1 



(89) 



MP) 



'-Z{ x 



1 



sphere volume \ 
box volume I 



(90) 



= [2.10] Two quantum particles in a box with interaction 

The calculation of the partition function Z% for two identical quantum particle in a box, is both interesting and later 
on useful for the purpose of calculating the second virial coefficient of an N particle gas. The Hamiltonian is: 



P 2 p 2 
U = — + l - + V(r) 
4m m 



(91) 



In order to be able to do the calculation using separation of variables we cheat with the boundary conditions as 
follows: The center of mass motion is confined to a box of volume V = (47r/3)i? 3 , and the relative motion is confined 
by \r\ < R independently. Accordingly the partition function is factorizes as follows: 



Z 2 = 2 3 / 2 



V 

A',: 






-0E n 



= 2 3 / 2 



V 

A',. 



£■ 



-PE b 



L b 



l(k)dk e -W m ) fe2 



(92) 



where (n, £, m) are the good quantum numbers for the relative motion. Ignoring the possibility of spin, the sum is 
over even or odd values of £, for Bosons or Fermions respectively. In the second equality we separate the bond states 
from the scattering (continuum) states. In order to determine the DOS of the latter wc recall that the radial wave 
functions are phase shifted spherical Bcssel functions. Accordingly the box quantization condition for the allowed k n 
values is 



kR 1 + Se = nn 

2 



(93) 



From here one deduce a formula for the effect of the phase shifts on the DOS: 



m 



(0) 



(k) 



l -t(2i + l 



dde 

dk 



(94) 



Using this result we get after integration by parts the following expression for the interaction effect on the partition 
function: 



Z 9 



,(()) 



2 3/2 



V 

A'-i. 



^e-^+^E 



b 



7T 



kdk 6 t (k) e -^/ m ) fe2 



(95) 



V(x) 
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[3] The canonical formalism 

= [3.1] The energy equation of state 



Consider some system, for example particles that are confined in a box. The Hamiltonian is 

H = H(r,p;X) (96) 

where X is some control parameter, for example the length of the box. Assuming that we are dealsing with a stationary 
state, the energy of the system is 

E = (H) = trace(Hp) = ^p r E r (97) 

r 

If the system is prepared in a canonical states, then it is a mixture of energy eigenstates with probabilities 

Pr = ^-^ (98) 

where the partition function is 

Z((3,X) = Y, e ^ E " (") 

r 

One observes that the energy of a system that is prepared in a canonical state can be derived from the partition 
function as follows: 

Id dlnZ , s 

E = < w > = -zl^ z ^ —BIT (100) 

Also one can find expressions for the higher moments, for example 

<" 3 > - \hh z lm) 

In particular one deduces the relation 

Var(£) = (^)-(nr = §p(~z) = T^ = T*C (102) 

where in the latter equality we have defined the temperature a.s T — 1//3 and the heat capacity as C = dE/dT. The 
notion of temperature will be discussed further below. 

= [3.2] The Equipartition theorem 

In the classical context the Hamiltonian might be a sum of quadratic terms 

u = Y. c d ( 103 ) 

3 
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where qj are either coordinates of conjugate momenta. The partition function factorizes, where each quadratic term 
contributes a ex T 1 / 2 term. It follows that each quadratic term contributes T/2 to the energy, and hence 1/2 to the 
heat capacity. 

This observation can be applied to the analysis of "balls connected by springs". We can always go to normal 
coordinates. The center of mass degree of freedom contributes T/2 to the energy, while each vibrational mode 
contributes T. 

A formal extension of this so-called " Equipartition Theorem" is as follows: 

<C) = ^ ^ 

The proof is as follows: The measure of integration over phase space can be written as dqidqjdq' , where q' represents 
all the other coordinates. Applying integration by parts we have 



/' 



dqdqjdq 1 q t ??M e -m<i) = _ * [dqidqjdq' q % -?- U~ m ^\ = 8 %3 \ f dqtdqjdq' 'e~ m ^ (105) 



d p j ^<~v~* *. Qq 



'!.) 



ft 



and form here follows the Equipartition Theorem. This generalized version is useful in discussing particles that have 
interaction u(xi — Xj) ex \x% — Xj\ a , which constitutes a generalization of the harmonic (a — 2) case. 



[3.3] Heat capacity 



From the Equipartition Theorem one deduce that the heat capacity of an "ideal" system equals to the effective number 
of freedoms: Each independent quadratic term in the Hamiltonian contributes 1/2 to the heat capacity. This simple 
prescription should be refined for two reasons: (i) Degrees of freedom can "freeze" in the quantum treatment; (ii) In 
general a many body system is not ideal due to interactions. We first discuss the quantum issue referring to spins 
and oscillators. 

Spin and oscillator.— For spin (+) or oscillator (-) with level spacing w we have 

ln(Z(/3)) = ±ln(l±e- /3 ") (106) 

„ d In Z u> , 

E = —er = e^r ( 107 ) 

d,E e w / T /uj\ 2 

c ^ ^ § ^ 5^±Ip(?) < 108 » 

In both case the low temperature behavior of C is identical, namely, for T <C u) it is dominated by the Boltzmann 
factor e~P u . At high temperature C of the spin drop down because energy reaches saturation, while C of the oscillator 
approached unity reflecting the classical prediction E w T '. Since E = ton it is more illuminating to re- write the above 
results as follows: 

' /(«) (109) 



w e^±l 

Var(n) - [1 ± /(«)]/(«) (110) 

where f(ui) is known as the occupation function. In the case of an oscillator the result for the number variane can be 
regarded as a sum of a shot-noise particle-like term Var(n) = (n), and a classical term Var(n) = (n) 2 . In the case of a 
spin the fluctuations go to zero in both the "empty" and "full" occupation limits. It is customary in quantum-optics 
to characterize the fluctuations by g^ = ((n 2 ) — (n))/(n) 2 and to say that the thermal result g^ — 2 corresponds 
to bounching, while the "spin" result g^ = corresponds to anti-bounching. The value g^ = 1 reflects Poisson 
statistics and would apply in the case of coherent state preparation. 
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Debye model.— Let us refer further to a system that can be described as consisting of many harmonic freedoms, e.g. 
modes of vibrations. The spectral density of the modes might be oc a;" -1 . For example in Debay model a = d = 3, 
with some cutoff frequency ui c . Then we get for the heat capacity 



C(T) = const 



e "/T 
(c"V T - l) 2 



du! = const T 



a F( 



T 



(111) 



where 



F(y) 



-l+a 



(c*-l) 2 ' 



dx 



(112) 



The quantum result can be described as emerging from "freezing" of freedoms due to the quantization of energy. This 
phenomena has lead to the birth of quantum mechanics in the context of Black body radiation (Planck law) . 

Phase transitions.— We shall discuss phase transitions in later lectures. As the temperature is lowered towards 
a critical temperature T c the system becomes "correlated", which means that the effective number of freedoms is 
reduced. We assume T > T c and note that similar picture applies if one approaches T c from below. We can easily 
explain why the heat capacity diverges as T c is approached. For an ideal gas the partition function is Z = g N where 
N is the number of independent particles, and g = (L/\x) d oc T d//2 is the number of accessible states for a single 
particle that has a temperature T. For a correlated system Z = g N °", where JV eff = N/£ d is the effective number of 
independent regions, and £ is called the correlation length. The prototype Ising model consist of spins (g = 2) and 
£ oc \T — T c \~' / where v m 1/2. Either we can write the expression for the heat capacity as follows: 



C(T) = 



dlnZ 



d 2 lnZ 



d(lnT) d(lnT) 2 



= C 3 (T)+Q(T) 



(113) 



The non-singular term C g (T), that for an ideal gas equals Nd/2, originates from the dependence of g on the tem- 
perature, and is proportional to the effective number of freedoms. The singular term C^(T), that for Ising system is 
proportional to \T — T c \ vd ~ 2 , originates from the dependence of £ on the temperature. 
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[3.4] The Virial theorem 



Somewhat related to the equipartition theorem, is the Virial theorem. It is used to relate the expectation value of 
the "kinetic" and "potential" terms in Hamiltonian of the type H = K(p) + U(r). 

Consider any observable G. It is clear that if the system is prepared in a stationary (not necessarily canonical) state, 
then the expectation value (G) is constant in time. By the rate of change equation of motion it follows that 



[n,G] 



(114) 



In particular let us consider the generator of dilations 



(115) 



We get 



dK\ 

dp J 



dU 

Or 



(116) 



with implicit summation over j. For quadratic K(p) and U(r) the first term equals 2(K), and the second equals 
—2(U), and both equal T/2 if the classical equipartition theorem applies. More generally, for two-body interaction 
term of the type 



u ( r ) = J2 u ( r *~ r ^ = 



(ij) 



(117) 



the second term in the Virial theorem equals —a(U). This is a meaningful statement for a > 0, otherwise there should 
be a "box" that confines the particles. See later discussion of how the "pressure" gets into the Virial theorem. 



[3.5] Generalized forces 



Assume that X is a parameter that appears in the Hamiltonian. We define the generalized force T which is associated 
with the parameter X as 



7 



ax 



(118) 



This definition will be motivated later on when we discuss the notion of work. Here are some examples for generalized 
forces: 



parameter 


generalized force 


piston displacement in cylinder with gas 


Newtonian force 


volume of a box with gas 


Newtonian pressure 


length of a polymer 


Newtonian force (tension) 


homogeneous electric field 


total polarization 


homogeneous magnetic field 


total magnetization 


magnetic flux through a ring 


electric current 



The last item deserves some more explanation. First we note that the current in a ring (times the area) is like 
magnetization. The direct identification of (J 7 ) as the current is based on the following argumentation: If we make a 
change dX of the flux during a time dt, then the electro motive force (EMF) is —dX/dt, leading to a current 1 in the 
ring. The energy increase of the ring is the EMF times the charge, namely dE = (—dX/dt) x (Idt) = —IdX. 
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Given X and assuming that the system is prepared in a canonical state characterized by some /3, we can derive the 
average value y of the generalized force T from the partition function as follows: 



[3.6] Pressure on walls 



Possibly the simplest point of view about pressure is to regard it as arising from collisions of particles with the walls. 
This is the so called the kinetic picture point of view. However, within the framework of the canonical formalism 
the pressure is defined as the generalized force that is associated with the volume, such that ctW = PaW. It is quite 
puzzling that in the formal classical calculation the kinetic part factors out and the mass of the particles does not 
appear in the result: 

3N 
ln(Z(/3,V)) = — — ln/3 + TVlnV + const (120) 

E - -°-w - > <-) 

With interactions we have to calculated a complicated configuration {dridr2-.-dr^) integral. This calculation will be 
discussed in later sections. In the absence of interactions we see that the pressure is related to the kinetic energy, 
namely P = (2/3)£'/V. Below we generalize this relation using the Virial theorem: we shall sec that quite generally, 
both classically and quantum mechanically, the pressure is related to the kinetic and potential energy of the gas. 

The volume deformation of a box is represented by a deformation field D(r). To be specific let us write the Hamiltonian 
of N gas particles in a box as follows: 

H = K(p) + U(r) + V L (r-\D(r)) (123) 

Here K (p) is the kinetic term, and U(r) are the interactions, and Vj,(r) is box defining potential, and A = V — Vo is 
the deformation parameter. We want A to equal the extra volume due to the deformation, and therefore normalize 
the displacement field such that 

D ■ ds = 1, standard choice: D(r) = — r (124) 

o v 

Accordingly the definition and the expression for the pressure are 



P 



_dH\ _ J_/ 5Vl\ _ J_ 17 dK\ I dU 



<9V / 3V \ dr J 3V 



dp J \ dr 



(125) 



where in the last equality we have used the Virial theorem. Note that this extends the discussion of the Virial theorem 
in previous section. The case of inter-atomic interactions with a > (bonded system with no walls) can be regarded 
formally as a special case of the above relation with P = 0. If a < there is non-zero pressure. We can use the 
equipartition theorem to get in the classical case 



>-h 



' 


1 / dU 


NT- 


- ~ ( r- -7T- 




3 \ dr 



(126) 



where the first term is the same as in the law of ideal gases, while the second is due to the interactions, and can be 
expressed using moments of the inter-particle separation. 
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[3.7] Tension of polymer 



The calculation of a tension of a polymer is very similar to the calculation of pressure. The parameter in the 
Hamiltonian is the length L of the polymer, which is analogous to the length or the volume of the box that contains 
the gas particles. In both cases the formal result depends only on the configuration integral, while the kinetic term 
factors out. Thus in both cases the result does not depend on the mass of the gas particles or on the mass of the 
monomers from which the polymer is composed. The partition function in the case of a polymer is 

Z(0, L) = [kinetic term] x £ 5(L - (a* + x 2 + ... + x N )) ^^(configuration) (m) 

conf. 

Typically, in analogy with the case of hard spheres, the potential energy merely restricts the space of allowed con- 
figurations. We see that up to a factor Z(f3,L) is the probability to observe length L. According to the central 
limit theorem for long polymer Z(f3,L) ex exp(— (l/2)(L/Lo) 2 ), where Lq is the RMS average length of the polymer. 
Accordingly we get for the tension F = (T/Lq)L as for a spring. 

Optionally, if we insist on calculating directly the partition sum, it is simpler to imagine that a given tension F is 
applied on the polymer. The associated partition function is 

Z G {fi,F) = [kinetic term] x ^ e -/3[c/(configuration)-F.(x 1 +a :2+ ... + x N )] (128) 

conf. 

If the potential energy of the polymer can be written as a sum u(x\) + u{x2) + u(xn) of independent terms, then 
Zq(/3, F) can be factorized, reflecting that the total length of the polymer for a given tension is the sum of lengths 
of the monomers for the same tension. If required we can come back to Z(/3, L) by an inverse transform. Note that 
from mathematical point of view Z(L) and Zq(F) are the probability function and its associated moment generating 
function, and we simply reconstruct here the procedure which is used to derive the central limit theorem. 



[3.8] Polarization 



The polarization is the generalized force that is associated with electric field. Let us assume that we have a bonded 
system of particles with an added uniform electric field: 

„2 

% = 7 — - 5 — + interactions + potential — y^ q a £x a (129) 

a a 

p = ~ = $>«*<» ( 13 °) 

a 

The polarization P is the expectation value of P. 

= [3.9] Magnetization 

The magnetization is the generalized force that is associated with magnetic field. It is either due to having spin degree 
of freedom (Pauli) or due to the orbital motion. Here we clarify the definition using the three simplest examples. 

Pauli magnetism.— Consider a collection of N spins. We denote the magnetic filed by h. The Hamiltonian is 

N 

n = -Y,9<*hS« (131) 



a = l 

N 






dh 

Q = l 
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The magnetization M is the expectation value of M. 



Orbital magnetism (ID).— Consider a spinlcss particle in a ring of length L, and area A. The magnetic flux is 
$ = hA. The Hamiltonian is 



U = ^(p-4) 2 + V( X ) (133) 

v = i[U,x] = -{P- e TJ ( 134 ) 

dH e , , 

1 = -M = r < 135 > 

M = -_ = A7 (136) 

The magnetization M is the expectation value of M, or optionally we can refer to the circulating current 7, which is 
the expectation value of 7. 

Orbital magnetism (2D).— The more interesting case is the magnetization of electrons in a 2D box (3rd dimension 
does not play a role) due to the formation of Landau levels. We recall again that classically the energy spectrum of 
the system is not affected by magnetic field. But quantum mechanically Landau levels are formed (see "Lecture notes 
in Quantum mechanics" ) . Let us consider a box of area A that contains N spinless electrons. In the bulk, the energy 
of a Landau state that belongs to the v level is e v = (v + (1/2))wb where ljb = eB/m is the cyclotron frequency. The 
degeneracy of each Landau level is gs — eBA/2ir. The calculation of the single particle partition function is the same 
as that of harmonic oscillator (multiplied by the degeneracy). Assuming N electrons that can be treated as an ideal 
Boltzmann gas we get 

M = -£(A) 9 * +0(B 3) (137) 

This result does not hold for a low temperature electron gas, because the Fermi statistics of the occupation becomes 
important. Assuming zero temperature we define B n with n = 1,2,3, ... as the threshold value for which n Landau 
levels are fully filled. This values are determined by the equation rigs = N. Considering first strong field B > B\, the 
energy of the system is Eq = Nwb/2 and hence 

This result has a simple interpretation using " Bohr picture" of an orbiting electron: each electron performs a minimum 
energy cyclotron motion with unit angular momentum L, and associated magnetic moment — (e/2m)L. If the magnetic 
field is £>„+i < B < B n , one has to sum the energy of the electrons in n filled Landau levels, where the upper one is 
only partially filled. One obtain a quadratic expression from which it follows that the magnetization grows linearly 
from — iV(e/2m) to +iV(e/2m). Hence there is saw-tooth dependence of M on the field, which is known as the de 
Haas van Alphen (dHvA) oscillations. 

There is a simple way to understand the dHvA result. For this purpose assume that A looks like a circle. Each 
"Landau state" occupies a strip that has a finite width. Within each strip there is a diamagnctic cyclotron motion 
whose net effect is like having an inner anticlockwise current (7^ > 0), and an outer clockwise current (I (J < 0). The 
total current of the strip is determined by the potential gradient (Hall effect): it is zero in the bulk, and positive along 
the edges. The bulk has diamagnetic contributions to the magnetization, because I (J encloses a larger area compared 
with 7^. When B cross B n we get a jump in the magnetization that corresponds to the occupation of the edge states: 
The hall conductance of n Landau levels is Gh = (e/2ir)n, residing in a region that experiences a potential difference 
ujb- Hence the drop in the magnetization is (Gh^b) xA= N(e/m). 
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[3.10] Empirical temperature 



In practice we would like to be able to probe the j3 of the environment. For this purpose we use a thermometer. The 
simplest thermometer would be an ideal gas in a box, for which The partition function is 

Z(ftV) = V»(^)' (139) 

The empirical temperature is defined as follows: 

PV 1 

9 = — = - (141) 

N fi V ' 

We can of course define different type of thermometers. The idea is simply to identify a measurable quantity that 
reflects the parameter fi. 

= [3.11] Absolute temperature and entropy 

If we formally vary X and /3 as parameters, then the canonical energy changes: 

d ^ E \ dX 
dX r 



dE = J2 d Pr E r + Y,Pr dE r = ( J2 ^ Er ) ^ + ( J2 



- ydX (142) 



The absolute temperature is defined as the integration factor of the first term 

T = integration factor = — (143) 

P 

such that 

dE = TdS-ydX (144) 

The implied definition of the thermodynamic entropy is 

S = -^2p r lnp r (145) 

Note that the thermodynamic entropy is an extensive quantity in the thermodynamic limit. It should not be confused 
with other types of "entropy". In particular we shall discuss the "Boltzmann entropy" in a later section with regard 
to the 2nd law of thermodynamics. 
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[4] Thermodynamics 

= [4.1] The Helmholtz function 



At this stage it is convenient to define the Helmholtz generating function 

F(T,X) = ~]nZ(f3;X) (146) 

which allows to write the state equations in an elegant way: 
dF dF 



[4.2] Optional thermodynamic functions 



State functions that describe physical systems can be derived from generating functions. The generating functions 
should be expressed using their canonical variables. Following formal mathematical procedures we can obtain from 
the Helmholtz function optional generating functions. The most popular are 

F (T, A) , dF = -SalT - ydX (148) 

G{T,y) = F + yX, dG^-SdT + Xdy (149) 

E (S, X) = F + TS, dE = TdS - ydX (150) 

S(E,X), dS=^dE+^dX (151) 

The derivatives of the state equations are know as the "thermodynamic constants" though they are not really con- 
stant... 

C = T— (152) 

dy_ 
OX 

In the context of gases 



x = U tm 



Compressibility = — — (154) 

ExpansionCocff = — — (155) 

oT 



= [4.3] The Grand Hamiltonian approach 

It is customary in thermodynamics to define "thermodynamic potentials" that are obtained from the Helmholtz free 
energy by means of Legendre transform. This can be regarded as a formal mathematical trick for switching the role of 
conjugate variables, but it also can be motivated physically. It is the same procedure that we had discussed regarding 
the calculation of the tension of a polymer. Here we repeat it with regard to a gas in a box with piston. 

Let us regard the position of the piston (the parameter X) as a dynamical variable (let us call it x). We can apply 
force, say "electric" field £ on the piston. Accordingly the "grand Hamiltonian" of the system is 

%G = H + £x + [optional kinetic term] (156) 
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The optional kinetic term is required if the piston has finite mass, but its inclusion will not affect the analysis because 
it factors out of the calculation. The partition function is 



z G W,£) 



E 

x,r 

E cx p 



HE, 



= E 


exp 


G[x;T,£}~ 




T 





F(T,x) + £x 
T 

,-G(T,£)/T 



(157) 

(158) 



where G(T,£) = G[x;T,£], and the most probable value x is obtained by minimizing G[x;T,£]. This means that x 
should be determined by solving the equation £ = y{x). Accordingly we realize that G(T, y) is the Lcgendrc transform 
of F(T,X). The roles of the conjugate variable X and y have been switched. If X and y are the volume V and the 
pressure P, then G(T, P) is known as the Gibbs function. 



[4.4] The chemical potential 



Consider a gas that consists of N identical particles. This can be either classical or quantum gas (contrary to prevailing 
misconception, quantum mechanics is irrelevant to this issue - this will be explained in the "chemical equilibrium" 
lecture). Within the framework of the canonical formalism we define the chemical potential as follows: 



H(T,V,N) 



dF 
dN 



(159) 



Accordingly we have 



dF = -SdT - PdM + fidN 
dG = -SdT + \JdP + iidN 



(160) 
(161) 



The above definition of the chemical potential can be motivated by adopting a " grand Hamiltonian" perspective. Let 
us define a "grand system" that consists of the system and of a reservoir of particles. This reservoir consists of a 
huge collection of sites that hold a huge number of particles with binding energy \i. If we transfer N particle from 
the reservoir to the system the energy of the "grand system" becomes 



H G = H - nN 



(162) 



The so called grand partition function Zg((3,Ij) of the Grand system will be discussed in future lecture. 



[4.5] The extensive property 



At this stage it is appropriate to remark on a relation between the Gibbs function and the chemical potential that 
holds is the so-called thermodynamic limit. In this limit the system acquires an extensive property that can be 
formulated mathematically Relating to the Gibbs function G(T, P; N), one observes that if N is multiplied by some 
factor, then the volume V and the entropy S for the same (T, P) are expected to be multiplied by the same factor, 
and hence also G should be multiplied by the same factor. We therefore write 



G(T,P,N) = N G(T,P,1) 



(163) 



From /i = —dG/dN we deduce that the chemical potential is merely the Gibbs energy per particle. Consequently 
from the expression for dG it follows that 



dft 



S_ 
"AT 



dT 



V 



-dP 



(164) 
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[4.6] Work 



In the definition of work the system and the environment are regarded as one driven closed unit. If we change X in 
time then from the "rate of change formula" we have the following exact expression: 

dE /&H\ 



it follows that 

W = work done on the system = E ti „ al — E initM = — / (T ) t dX (166) 

This is an exact expression. Note that {F)t is calculated for the time dependent (evolving) state of the system. In 
a quasi-static adiabatic process one replaces (J 7 )* by {J-)x{t)-> where the notation (.F)x implies that the system is 
assumed to be in a canonical state at any moment. More generally, within he frame work of linear response theory 

{Pit « Wx -r)X = y{X)-r)X (167) 

The first terms is the conservative force, which is a function of X alone. The subscript implies that the expectation 
value is taken with respect to the instantaneous adiabatic state. The second term is the leading correction to the 
adiabatic approximation. It is the "friction" force which is proportional to the rate of the driving. The net conservative 
work is zero for a closed cycle while the "friction" leads to irreversible dissipation of energy with a rate 

VV irreversible - V X 2 (168) 

More generally it is customary to write 

W = -W + W lrrovorslblo (169) 

where the first term is the conservative work, or so to say "the work which is done by the system" 

W = f(T) x dX = f y{X) dX (170) 

J Jx A 

The two main examples that illustrate the above discussion are: 

Example 1: 

X = position of a wall element (or scatterer) 

X = wall (or scatterer) velocity 

(J 7 ) = Newtonian force 

— rjX = friction law 

rjX 2 = rate of heating 



Example 2: 



X = magnetic flux through the ring 

—X = electro motive force 

(J 7 ) = electrical current 

— r\X = Ohm law 

rjX 2 = Joule law 



2o 




x(t) 




[4.7] Heat 



In order to understand which type of statements can be extracted form the canonical formalism we have to discuss 
carefully the physics of work and heat. We distinguish between the system and the environment and write the 
Hamiltonian in the form 



Wtot.1 = K{r, p; X{t)) + H,„ t + K„ 



(171) 



It is implicit that the interaction term is extremely small so it can be ignored in the calculation of the total energy. 
The environment is characterized by its temperature. More generally we assume that the environment consists of 
several "baths" that each has different temperature, and that the couplings to the baths can be switched on and off. 
Below we consider a process in which both the initial and the final states are equilibrated with a single bath. This 
requires that at the end of the driving process there is an extra waiting period that allows this equilibration. It is 
implied that both the initial and the final states of the system are canonical. Now we define 



W = work = (jH total ) B - {H tatal ) A 

Q = hcat = -((K„v)b-(K.„v)a 
E^ - £ lnitial = (H) b -(H)a = Q 



W 



(172) 

(173) 
(174) 



It is important to emphasize that the definition of work is the same as in the previous section, because we regard 
"H total as describing an isolated driven system. However, E is redefined as the energy of the system only, and therefore 
we have the additional term Q in the last equation. 



[4.8] Quasi static process 



In general we have the formal identity: 
dE = y dp r E r + y p r dE r 



(175) 



We would like to argue that in the adiabatic limit we can identify the first term as the heat dQ and the second term 
is the work —dW. One possible scenario is having no driving. Still we have control over the temperature of the 
environment. Assuming a quasi-static process we have 



dX = 

dE = y. dp r E r = TdS 

dQ = TdS 

dW = 

Q = [E(B) E(A)] 

W = 



(176) 
(177) 
(178) 
(179) 
(180) 
(181) 
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A second possible scenario is having an isolated systems going through an adiabatic process: 

dp r = (182) 

dE = J2Pr dE r = -ydX (183) 

r 

dQ = (184) 

dW = ydX (185) 

2 = (186) 

W = -\E(B) - E(A)] (187) 

Any general quasi-static process can be constructed from small steps as above, leading to 

Q = f TdS (188) 

J A 

W = ( y(X)dX (189) 

J A 

In particular for isothermal process we get 

Q = Tx [S(B) - S{A)} (190) 

W = ~[F(B) - F(A)] (191) 

If a process is both isothermal (constant T) and isobaric (constant P) we can sill get work being done by changing 
some other parameter X, then we get 

Q = Tx [S(B) - S(A)} (192) 

W = -[G(B)-G{A)] (193) 



— = [4.9] Cycles 

It is possible to design cycles in (X, T) space, such that the net effect is to convert heat into work (engine) or in reverse 
(heat pump). Consider for example a gas in a cylinder with a piston. If there is no restriction on the availability of 
baths the simplest engine could work as follows: Allow the gas to expand at high temperature; Lower the temperature; 
Compress the gas back by moving the piston back to its initial position; Raise back the temperature. The net effect 
here is to convert heat into work. The disadvantage of this cycle is that in order to make it reversible we need infinitely 
many intermediate baths in the heating and cooling stages. If we want to use a reversible cycle that uses two baths 
only we need the Carnot cycle. Then we can operate it also in reverse to get a heat pump. 

Assuming that the levels become more dense as X is increased, it follows that the result of an adiabatic process 
would be a lower temperature (adiabatic cooling). If the process is isothermal rather than adiabatic there will be 
heat absorption (isothermal absorption) due to the re-distribution of the probabilities. These "rules of thumb" allow 
to gain intuition with regard to the operation of engines and heat-pumps. 

The simplest system to consider is spin S 3> 1 in magnetic field. In order to be consistent with the piston example 
we define X = —\h\, so larger X is like larger volume, i.e. higher density of states. We consider a cycle that consists 
of 4 stages: adiabatic cooling to lower temperature; isothermal absorption stage (Q\ > 0); adiabtic heating to higher 
temperature; and isothermal emission stage (Q2 < 0). The net effect is is to transfer heat from the cold bath to the 
hot bath, which requires to invest work. 

At each stage the work W is positive or negative depending on whether the occupied levels go down or up respectively. 
The inclination is to say that during the adiabatic cooling stage the work is positive. This is true in the piston example, 
but not in general, as can be verified with the spin example. It should be clear that doing work on the system does not 
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imply that the its temperature becomes higher: the simplest demonstration would be to take an isolated container 
with gas to the top of Eifel Tower: it requires work but the temperature is not affected. What is essential for the 
operation of the cycle is the variation in the density of the levels, irrespective of how they go up and down during the 
cycle. 
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[5] Chemical equilibrium and the Grand Canonical state 
= [5.1] The Gibbs prescription 



In this lecture we are going to discuss chemical equilibrium. We shall see that the condition of chemical equilibrium 
involves the chemical potentials of the participating gases. For the purpose of calculating fi it is essential to find how 
the partition function depends on the number of particles. Classically the calculation of Zf^ for a gas of identical 
particles is done using the Gibbs prescription: 

Zm [Gibbs] = — Z^v [distinguishable particles] (194) 

We shall motivate this prescription in the following discussion of chemical equilibrium. For an ideal gas we get 
1 „v , V v^ _*„. , V 



>iV 



-Z?, where Z 1 = ^ c -^ u „ d = (195) 



Nr 1 ' ,,iiviv ~ y - \\^ - au A| 



The summation is over the non-translational freedoms of the particle. Hence we get 

»--%- Hi) - -+H&) (l96) 

where in the last equality we have assumed that the particle has a single well defined binding energy. The inverse 
relation is 

N = Z X C^ = ^ e -(eo-M)/T (197) 

The notion of identical particles:— The notion of identical particles does not require extra explanations if they 
are indistinguishable as in the quantum mechanical theory. Still we can ask what would happen if our world were 
classical. The answer is that in a classical reality one still has to maintain the Gibbs prescription if one wants to 
formulate a convenient theory for Chemical equilibrium. Namely, the condition for "chemical equilibrium" that we 
derive below has a simple form if we adopt the Gibbs prescription. Without the Gibbs prescription one would be 
forced to formulate an equivalent but non-friendly version for this condition. 

= [5.2] Chemical equilibrium 

Consider the following prototype problem of chemical equilibrium: 

A[a] ^ A[b] (198) 

where "a" and "b" are two phases, say to be in one of two regions in space, or to be either in the bulk or on the 
boundary of some bounded region. Given N identical particles we characterize the macroscopic occupation of the two 
phases by a reaction coordinate n, such that N — n particles are in phase [a] and n particles are in phase [b]. The 
partition function is 

The combinatorial " mixing" factor in the curly brackets counts the number of possibilities to divide N particles into 
two groups. It should be excluded if the particles are indistinguishable, as in the quantum theory. In the classical 



29 

theory, where the particles arc distinguishable it should be included, but it can be absorbed into the definition of the 
partition function. This is what we call the "Gibbs prescription" . Using the Gibbs prescription the above sum can 
be re- written as follows: 

Zt = T, Z N-n Z n (200) 

n 

The probability to have (N—n, n) occupation is proportional to the nth term in the partition sum: 

f ATI 1 7 a 7 6 7 a 7 b 

»<"> = {iiv^i} X ^r - - J ^JT ' °«»H' (*-<"-> + **<»»] (201) 

One should appreciate the usefulness of the Gibbs prescription. It is thanks to this prescription that the Free Energy 
is additive. If we did not use the Gibbs prescription we would be compelled to add in F a term that reflects "mixing 
entropy". The most probable value n is determined by looking for the largest term. This leads to the Chemical 
equilibrium condition: 

F a (N-n) + F b (n) = minimum (202) 

~> -fj, a (N - n) + u 6 (n) = (203) 

Let us consider the case of ideal gases. Using the expression for [i we get 

" - ZX ^ n = N^r (204) 



N-n Zf Zf + Z\ 

This example is simple enough to allow a determination of the average value (n) too. The probability distribution of 
the reaction coordinate is 

A" (Zf)"-" (Z h X 

"«"» - («-„).„. L WW <205) 

leading to 

(n) = ^2p(n) n = n (206) 

n 

We see that the expectation value of n coincides with its typical (most probable) value. In the more general case of 
chemical equilibrium, as discussed below, this is an approximation that becomes valid for N 3> 1 in accordance with 
the central limit theorem. 

= [5.3] The law of mass action 



This procedure is easily generalized. Consider for example 

2C ^ hA + 3B (207) 

Given that initially there are N A particles of type A, 7V B particles of type B, and Nc particles of type C we define a 
macroscopic reaction coordinate n such that Nc—2n is the number of particles of type C, and N A +5n is the number 
of particles of type A, and N B +3n is the number of particles of type B. Accordingly 

go, c _ ^ Z C Nc _ 2n Z% A+5n Z jv B +3n (208) 
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and 

p(n) = COIiste -^(N c -2n) + F«(N A+5 n) + F\N B+ 3n)) (20g) 

leading to the equation 

- 2/j, c (N c -2n) + 5(i a (N A +5n) +3/j, b (N B +3n) = (210) 

which with Boltzmann/Gibbs approximation becomes 

(JV A +5n) 5 (JV B +3n) 3 = (Zjf {Z\f 

(N c -2n) 2 (Zf)2 l ' 

or, assuming that [a],[b],[c] are all volume phases, 

( N A +5n \ 5 ( N B +3n \ 3 

{ v ' V Y ; = k(T) (212) 

where the reaction rate constant n(T) ex e e ' T depends on the reaction energy e = 2e c — 5e a — i£b- 



[5.4] Equilibrium in pair creation reaction 



Consider the reaction 

7 + 7 — c+ + e" (213) 

This can be analyzed like a chemical reaction C r=± ^4 + B, which is of the same type as considered in the previous 
version. The important point to notice is that Z c is independent of n, and therefore the chemical potential of the 
electromagnetic field is formally /j, c = 0. The electromagnetic field is like a "bath", so if we regard it as part of the 
environment, so we could have written vacuum ^ e + + e~. In any case we get at equilibrium 

M e+ (ni) + f{n 2 ) = (214) 

where in the Boltzmann/Gibbs approximation 

M (n) « mc 2 +Tln(^h (215) 

leading to 

„ in2 =^) 2 e* ! /T ( 2i 6 ) 

This problem is formally the same as that of a semiconductor where e + and c~ arc the holes and the electrons, and 
2mc 2 corresponds to the energy gap between the valance and the conduction bands. 
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[5.5] Equilibrium in liquid-gas system 



The equilibrium between a liquid phase and a gaseous phase is just another example for a chemical equilibrium. We 
can write the equation that determines the coexistence curve in (T,P) diagram as \p, a (T, P) — fXb{T, P)\ = 0. By 
implicit differentiation of this equation with respect to T we get the Clausius-Clapeyron relation 



dP 

~dT 



drj^a -Mb] _ ^S_ _ !_ [Latent heat] 
coexistence d P [fi a - /j, b ] AV T [Volume change] 



Outside of the coexistence curve either /i a or fif, are smaller, and accordingly all the particles occupy one phase only. 



[5.6] Site system 



The chemical potential can be calculate easily for a system of N identical particles that occupy a set of M sites (or 
modes) that have the same binding energy e. Since we assume that the biding energy is the same for all sites, it follows 
that estimating Z\ is essentially a combinatorial problem. We assume n > 1 so we can approximate the derivative of 
ln(n!) as ln(n). We also write the result for the most probable n which is obtained given /i. Note that the result for 
fi is meaningful only for large M. 




Fermionic site:— Each site can have at most one particle 

Zn = , r .f ! ,, e~ 0sn (218) 

n\{M — n)\ 



» = e + Tln^ — ) (219) 

n = M(e^ £ -^ + l)- 1 (220) 

Bosonic site:— Each site can have any number of particles. The combinatorial problem is solved by asking how 
many ways to divide n particles in a row with M — 1 partitions. If the particles were distinct the result would be 
(n + (M — 1))!. Taking into account that the particles are indistinguishable we get 

(n + M-l)\ pen 
n n!(M- 1)1 [ ' 

^ £ + rln ( (M-l )+ n ) ^ 

n = (M-l)(e /3(£ -' i) -l)- 1 (223) 

Electromagnetic mode:— Each mode of the electromagnetic field can be regarded as a Bosonic site that can occupy 
photons with binding energy u. Since n is not constrained it follows formally that 

H = (224) 

n = (c^-1)- 1 (225) 
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Boltzmann approximation:— Assuming dilute occupation (1 <C n <C M) we get a common approximation for 
both Fermi and Bose case: 



M n 



Z n = -—c-^ n (226) 



n 



,j = e + Tln(^) (227) 



n 



= Mc-^ 6 -^ (228) 



General system of sites:— If we want to consider the partition function of N particles in M sites that have different 
binding energies we have to calculate 

Zn{P) = Y, e -^ (£ini+ - +£MnM) (229) 

n 1 + ...+n M = N 

Because of the constraint the sum cannot be factorized. We therefore adopt the "Grand Hamiltonian" strategy and 
calculate the Grand partition function Z(/3,/j,) that corresponds to Hq = Ti. — /iN. In principle we can get Zjy(/3) 
from Z((3,fj,) via an inverse transform, but in practice it is more convenient to stay with the Grand Hamiltonian 
framework. 



[5.7] The grand canonical formalism 



We can regard the grand canonical formalism as a special case of the canonical formalism, where the Grand Hamilto- 
nian %q =H — pN describes a Grand system that consists of the gas particles and a hypothetical reservoir. Optionally 
we can motivate the introduction of a the grand canonical formalism following the same justification strategy as in 
the case of the canonical formalism. First we have to specify the many body eigenstates R of the system: 

N\R) = N R \R) (230) 

H\R) - E R \R) (231) 

Then we assume that the system can exchange particles as well as energy with the environment. The probability of 
a many body eigenstate R is 

e -/3-E B £e_nv 

PR = Z „+!L N ' with Z W-n « ^ N (232) 

We deduce that 



PR ■- z 



l _ c -fl(E R -nN R ) (233) 



where the normalization constant is 

Z(0,n) = ^ c -^( B «-^«) (234) 



The Grand Canonical Z(fi,\x) is defined in complete analogy with the canonical case as sum over the many body 
states "R". For some purposes it is convent to write is as a function Z(z; 0) of the fugacity: 
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The Grand Canonical Z((3,fi) can serve as a generating function as follows: 

N^(N) = J2PnNn = - R ^ (236) 

Equivalently 

F G (T,V, M ) = -^lnZ (239) 

N = -^ (240) 

dF r 
S = ~^f ( 242 ) 

E = F G +TS + ^N (243) 

In the thermodynamic limit Fg is extensive, also in the case of non ideal gas. Consequently 

F G (T,V,n) = -VP{T,n) (244) 

S N 

dP = ydT+-dfi (245) 

In other words rather then using the notation Fq, we can regard P(T,/i) as the generating function. Note that this 
is the "Grand canonical" version of the "canonical" Gibbs function relation 

d/i = -^dT+^dP (246) 

For constant T, a variation in the chemical potential is related to a variation dP = nd\x in the pressure, where 
n = N/y is the density. It is common to define the compressiblity of the gas as follows: 



1 dn 1 dn 

n dP n 2 dji 



«r = z^ = -2- (247) 



[5.8] Fermi occupation 



A site or mode can occupy n = 0, 1 particles. The binding energy is e. the site is in thermochemical equilibrium with 
a gas in temperature /3 and chemical potential /x. 

(248) 
(249) 

(250) 



JV„ 


— 


n 




E n 


= 


IK 




Pn 


= 


1 

z c 


-/9(«-f*)n 
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and accordingly, 

Z((3,n) = (l+c-^-^) (251) 

N(0,n) = (n) = 5>„n = -^_L_ = /( e - M ) (252) 

n 

£(/3,m) = (ne) = e/(e- M ) (253) 

We have denned the Fermi occupation function < /(e — /«) < 1 

= [5.9] Bose occupation 



A site or mode can occupy n = 0, 1, 2, 3... particles. The binding energy is e. the site is in thermochcmical equilibrium 
with a gas in temperature j3 and chemical potential /i. 

N n = n (254) 

E n = ne (255) 

Pn = ^c-^-^ n (256) 

and accordingly, 

Z((3,n) = (l-c-^-^y 1 (257) 

N{0,n) = (n) = ^>„n = _^_L_. = /( c - M ) (258) 

n 

£(/3,//) = (ne) = e/(e- M ) (259) 

We have defined the Bose occupation function < f(e — n) < oo. If e < \x then (n) — > oo. If e = /i then the site may 
have any occupation. If e < /i then (n) is finite. 

= [5.10] Bosonic mode occupation 



The occupation of a mode of vibration, say the number photons in an electromagnetic mode, or the number of 
phonons in a vibration mode, are described by the canonical ensemble, by can be optionally regarded as described by 
the grand-canonical ensemble with u = 0. With slight change in notations we have: 

N n = n (260) 

(261) 

(262) 



E n = nuj 






Pn ^C 






and accordingly, 






z^) = (l-c-^r 1 






N(p) = (n) = Y,Pnn = 


1 


= /M 


e?" - 1 " 


E(/3) = (ntu) = L0f{u) 







(263) 
(264) 

(265) 
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[6] Quantum ideal gases 
= [6.1] Equations of state 



In what follows, unless written otherwise e = is the ground state and 

/>OC 

E -+ / g( £ )* ( 266 ) 







The stationary states of the multi particle system are occupation states 

\n) = |ni,n 2 ,n 3 ,...,n r ,...) (267) 

where n r = 0, 1 for Fermi occupation and n r = 0, 1, 2, 3, 4, ... for Bose occupation. For these states we have 

N n = J2 n r ( 268 ) 

r 

E n = ^n r £ r (269) 

r 

p n oc e^^^^"- (270) 

which can be factorized. This means that each site or mode can be treated as an independent system. We use E 
and N without index for the expectation values in an equilibrium state. For the Fcrmionic and Bosonic case we have 
respectively (±) 

/>oo 

InZ = ±Vln(l±e-' 9(£ -^) = /3 / Af{e)de /(e - /*) (271) 

r Jo 

TV = ]T(n r .) = ^ /(e r - /i) = / g(e)de /(e - /*) (272) 

/•OO 

£7 = Ve r (n r )=V/(e r -M)e r = / g(e)ede f(e- u) (273) 

r r Jo 

1 1 7 1 1*00 

P = ^^r = y] -^) & /( £ -M) (274) 

It is good to remember that P(T, /j.) can serve as a generating function for all other state equations. This would be 
true also if the gas were not ideal. In particular N/V — dP/d[i relates the density to the chemical potential, which 
implies a relation between the the pressure P and the density N/V. 

= [6.2] Explicit expressions for the state equations 



We assume one particle states \r) that have the density 

g(e) = Vce"- 1 , N(E) = - e g(e) (275) 

a 

For a particle in d dimensional box a = djv where v is the exponent of the dispersion relation e ex \p\ v , and c is a 
constant which is related to the mass m. For example, in the case of spin 1/2 particle in 31? space we have 

g(e) = 2 x V^ e i (276) 
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The following integral is useful: 

/•DC 



F a (z) 



o 



e x ± 1 I x a ~ 1 dx 



±r(a)Li Q (±z) 



(277) 



where the ± refer to Bose and Fermi case respectively. Details of the Polylogarithm function Li Q (z) can be found in 
Wikipedia. In the physics community it is commonly denoted as g a (z). Note that 



oo _. 

U a (z) = ]T^ 



z + 



Li Q (l) = C(a) 



(278) 



FaW 




For larger values of z the function F a (z) grows faster in the case of a Bose occupation. In the latter case it either 
diverges or attains a finite value as z — > 1, namely F a (l) = T(a)(,(a). The latter expression gives a finite result for 
a > 1. In particular we have T(3/2) = i/tt/2 and C(3/2) ~ 2.612. In the Fermi case the integral is always finite. Using 
the step-like behavior of the Fermi occupation function we get for z>l 



F a (z-»l) 



1 



(lnz) a 



(279) 



We can express the state equations using this integral, where z is identified as the fugacity. We get 
N 



V 

E 

V 



= cT a F a (z) 

= cT a+1 F a+1 (z) 



(280) 
(281) 



while P is related trivially to the energy: 



P 



1 /InZ 



-(- 

a I V 



(282) 



The specific results in the case of a spin non-relativistic Bose particles are: 

1 



N 

V 

E 
V 

P 



T3 Li 3/2( Z ) 

Arp 

3 T 
2 A^ 

2 (E 

3 I V 



:) Li 5/2(^) 



T 

\3 U 5/2{z) 



(283) 
(284) 
(285) 



For more details see Huang p.231-232;242. 



37 



= [6.3] Ideal gases in the Boltzmann approximation 

Wc take e = as the ground state energy of the one-particle states. The Boltzmann approximation is 

f(e-n) » e-^~^ (286) 

It holds whenever the occupation is /() <C 1. If it is valid for the ground state e = 0, then it is valid globally for all 
the higher levels. Accordingly the validity condition is: 



z = c^ < 1 => Li(z) w z 

In the case of standard 3D gas the Boltzmann approximation condition can be rewritten as 
TVAf. < V ~» T>T C (Bosons), ~> T > T F (Fcrmions) 



(287) 



(288) 



We shall defined the condensation temperature (T c ) and the Fermi energy (T F ) is later sections. Within the framework 
of the Boltzmann approximation we can re-derive the classical equation of an ideal gas: 



N 

V 

E 
V 



1 
3 T 



i 



^/T 



3N r 
2V 



,-T 



T N 
P = 2 = T 



(289) 
(290) 
(291) 



Note that within this approximation E and P do not depend on the mass of the particles. 




g(E) 



H 




boltzman 
bose 



f(E-H) 



= [6.4] Bose Einstein condensation 

Let us write again the general expression for the occupation of the orbitals: 



N 

V 



(Am) 



c i e~ x dt 



i 



3 /3(e-M) - 1 



(292) 



we get 



N 

V 



(/3,M^0-) 



oo for a < 1 



(293) 
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while for a > 1 only a finite fraction can be accommodated: 

N 



V 



(/3,,U ->{)-) = cT(a)((a)T a for a>\ 



(294) 




for oc<:l 



H 




Tj <T 2 <T 3 




!-l 



In the latter case, of a > 1, having /i = implies 



for l<x 



(l<a) 



N 
V 



H 



f(/3,M=0) = ^+cr( a )C(«)T a 



(295) 



while 



E = cVr(a+l)C(a+l)T° 



1 (E 



(296) 
(297) 



In particular the standard results for condensation in 3D arc 



N = n + 



v <(i)(s) ,r ' 



" = 'l5 (S>-" 



(298) 
(299) 



The pressure P is independent of the total number of particles, because the condensate does not have any contribution. 
Hence the compressibility k oc (dP/dV)^ 1 = oo. If we change the volume the extra/missing particles just come from 
the ground state, which is like a reservoir of /j = particles. 

Given T, if we push N particles into a box, the condition to have condensation is TV > N(/3, fi —¥ _ ). The condensation 
temperature, below which \i = 0, is 



T,= 



1 



N 



cT (a) C (a) V 



l/a 



1 



(300) 
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where i is the average distance between the particles. Given N, if one tries to eliminate /j,, and writes it as a function of 
T, then one observes that below the condensation temperature /i is forced to become zero. Under such circumstances 
all the particles that cannot be occupied in the excited states have to condense in the ground state: 



Wo 
V 



N N (R 



= 1- 



N 

V 



(301) 



The common phrasing is that a macroscopic fraction of the particles occupies the ground state. This fraction is 
determined by (T/T c ) a or equivalently by V/At- Note that T 3> T c is an optional way to write the Boltzmann 
condition. 



[6.5] Fermi gas at low temperatures 



At zero temperatures the Fermi function is a step function. At finite temperatures the step is smeared over a range T. 
In order to find explicit expressions for the state functions we have to perform an integral that involves the product 
of /(e) with a smooth function g(e). The latter is the density of states g(e) if we are interested in N, or eg(e) if we 
are interested in E. There is a simple procedure to get an approximation say to second order in T. For this purpose 
we first define the zero temperature result 



G{u) = 



g{e)de 



(302) 



And then proceed with the finite temperature calculation using integration by parts: 



de g(e) /(e - fi) 



de G(e) [-/'(e - „)] 
dc 



deG(e) S T (e- n) 



G{n) + G'(p)(e - y) + -G"(/i)(c - m) 2 + 



G( M ) + Y T 2 g'(f,) + 0(T 4 ) 



S T (e- (J,) 



(303) 
(304) 
(305) 



We can apply this formula to the N = J\f(p) calculation. First we do the zero temperature integral, and from it 
eliminate /ias a function of N. This zero temperature result is known as the Fermi energy \x = £f- Then we 
substitute \i = ep + 5/i in the above second order expression, expand G(jj) w G(ep) + g(ep)dfi , and find 



MT) 



e F 



* %!{zf) t 2 

6 g(CF) 



0(T 4 



(306) 



E 







\i 



g(E) 




f(E-M-) 



40 




Ideal Fermi 

Liquid 



V- 



The specific result for N in the case of a power law DOS is: 



N = -cV/j, a I l + a(a-l) — (- 
a \ 6 \/i 



(307) 



leading after elimination to 



(f 



aN 
c V 



M = 1 - (a-1) 



TT 2 T 



6 V ep 



cf 



(308) 
(309) 



For "spinless" non-relativistic fermions in 3D we have 



1 3 3 / TT 2 (T 



.,3 1 ,„ x a 6 / 5tH (T 



(310) 
(311) 



leading to 



(f 



1 
2m 



6tt 2 



,iV 
V 



7T 2 /_T 

12 I CF 



E 



57T 2 

l2~ 



\ 3 



:Ne F 



(312) 
(313) 

(314) 



Note that in zero temperature, the pressure given by the equation 



5 v ; m V 



(315) 
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Systems with interactions 



[7] General perspective 



Energy scales:— With regard to the prototype models of systems with interactions there are generically two energy 
scales. One is the interaction strength e, and the other is the temperature T. For T>ea pcrturbative treatment is 
appropriate. See below the cluster expansion. For T <C e it is advised to re-model the system with Hamiltonian that 
describes its collective excitations. The interesting regime is T ~ e where the phase transition takes place. 

Models of interest:— It is natural to start with the discussion of gas-liquid phase transition and to realize that 
its "lattice gas" version is formally equivalent to "Ising model" that describes a ferromagnetic phase transition. Its 
generalization is known as the "Potts model". The system consists of Af sites. At each site there is a "spin" that 
can be in one of s possible states. The Ising model is a special case with s = 2, and the interaction is ctjCj, where 
a = ±1. The "Ising model" has a discrete up/down symmetry. Its Field theory version is known as the Landau- 
Ginzburg model. The Heisenberg model is a different lattice model that has Si • Sj interaction. This interaction 
has a continuous symmetry with respect to rotations. Its 2D version is known as the XY model. There are also 
corresponding Field theory models that are known as non-linear sigma models. 

First order phase transition.— There are systems where there are (say) two families of states, such that each 
family has different DOS with different minimum. In such case a control parameter (call it h) might induce a crossover 
from the dominance of one family to the dominance of the second family. This crossover is reflected in the partition 
function and hence in the heat capacity and in the state equations. In the thermodynamic limit the crossover might 
be abrupt. In such case it is a "first order phase transition". If a change in a parameter leads to a bifurcation in the 
calculation of the partition function, it is called a "second order phase transition". The prototype example for phase 
transition is ferromagnetism where the magnetization might be "up" or "down". 

Order parameter.— In order to analyze a second order phase transition it is useful to identify the "order parameter" , 
which is a field if(x) that describes the coarse grained state of the system. In the prototype example of ferromagnetism 
it is the magnetization density in the sample. Defining an entropy functional S[ip] that reflects the number of 
microscopic states that have the same field configuration, we can express the partition function as 

Z = Y,e~ A[v] , A[p] = ±[%>]-T%]] (316) 

v 

Symmetry breaking.— Second order phase transition is due spontaneous symmetry breaking leading to long range 
order. At T = the definition of symmetry breaking is very simple. It means that E[ip] attains (say) two distinct 
minimum that are described by different field configurations (different "order"). However, at finite temperature the 
canonical state is not the minimum of the energy functional E[p] but of the free energy functional F[p] = E[p] — TS[p]. 
Accordingly entropic contribution may wash away the broken symmetry. 

There is an implicit assumption with regard to the possibility to observe "symmetry breaking". It is assumed that 
tp{x) has slow dynamics. If a magnet is prepared (say) with "up" magnetization then it takes a huge time until it 
flips to the quasi degenerate "down" magnetization. 

Long range order.— In the prototype examples at high temperatures there is no "order" meaning that the correlation 
function g(r) = (ip(r)ip(O) decays exponentially. As the critical temperature is approached from above the correlation 
length diverges. Below the critical temperature there is "long range order" and the correlation function should be 
rc-defined with respect to the new order. There is a possibility to witness "infinite order phase transition" where 
below the critical temperature there is no long range order, but instead the correlation function become powerlaw. 
See discussion of the XY model. 

Formal analysis.— Disregarding a few models that possess exact solutions, the analysis of the partition function 
can be done by adopting the following stages: (1) Primitive mean field theory evaluates the partition function by 
calculating A[(p] for the field configuration that minimizes it. This corresponds mathematically to an inferior saddle 
point approximation. (2) Proper mean field theory is based on proper saddle point approximation, which means 
that the calculation takes into account the Gaussian fluctuations around the minimum. (3) Rcnormalization Group 
(RG) treatment is required in the critical regime, whenever the Gaussian approximation in not valid. It explains the 
scaling anomalies that are witnessed in the vicinity of the critical temperature. 



[7.1] Gas of weakly interacting particles 



Consider a classical gas of interacting particles: 
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N „2 



2m 

z=i (ij) 



(317) 



The partition function without the Gibbs factor is 



Ziv(/?,V) 



\rp 



s 



A 



dxi...dxN exp 



-/^E u ( 3 



(ij) 



-3- I / dx 1 ...dx N TT (1 + / (ary)) 



/(r) = e 



-/3u(r) 



(318) 
(319) 



Note that the configuration space integral has the dimensions of V^. It equals \/ N if there are no interaction. If there 
are interactions we can regard the f(r) as a perturbation. The we get an expansion 



ZjvGS.v) = 



V 



N 



I + coef 2 [ — J + cocf 3 I — 



(320) 



From here we can derive the so called Virial expansion for the pressure: 



NT ^ , m (N 
1=1 v 



NT 



, N\ (N 



(321) 



The a^ are known as the virial coefficients. Note that the leading order virial expansion is consistent with the 
Van-der-Waals equation of state provided one makes the identification a 2 = b — (d/T) , namely, 



P 



NT _ (N 

= — a \ — 

V - Nb \V 



(322) 



[7.2] The Virial coefficients 



It is simplest to calculate the Virial coefficients using the grand canonical formalism. From now on the dependence 
on the temperature is implicit, and we emphasize the dependence on the fugacity z. The grand canonical partition 
function using the Gibbs prescription is 



N=0 



N 



[Here Zjy is defined without Gibbs factor] 



(323) 



For an ideal classical gas all the Zjv are determined by the one-particle partition function, namely Zjy = Zf. Ac- 
cordingly ln(Z) includes a single term, namely ln(Z) = Z±z. It makes sense to assume that interactions and quantum 
effects will add higher order terms. Hence we postulate an expansion 



lnZ(z) 



rB„z" 



n=l 



V^& n (T) 

71=1 



\rp 



(324) 
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where b\ = 1, and b n has the dimension of length™ -1 . Note that for an ideal Bose or Fermi gas one obtains 
bi = (±l) e+1 £~ 5 / 2 (Ay)™ -1 . We would like to find a procedure to determine these coefficients if there are weak 
interactions between the particles. Once they are known we get the state equations from 



d 

N = z—lnZ 
oz 

T 
P = -InZ 



(325) 
(326) 



leading to 



v - X>» (r > (i) 



P 

T 



X>cn 



Arp 



(327) 
(328) 



It is customary to eliminate z/\^ from the first equation and to substitute into the second equation, getting 



ai = b\ = 1 
a 2 = -b 2 
a 3 - -tu 2 



Abl - 26 3 



(329) 
(330) 
(331) 



It is implied from the discussion in the next section that in order to get b 2 we just have to calculation the two-body 
partition function Z 2 . Namely: 



b 2 = V 



A 



- 1 



[no Gibbs prescription here! 



(332) 



The calculation of Z 2 for two interacting quantum particles, given the scattering phase-shifts, has been outlined in a 
past lecture. In the classical case it is standard to assume that the gas particles are like hard spheres, each having 
radius R, with some extra attractive part that has depth ~ eq, similar to Lenard- Jones potential. Accordingly in 
leading order 



b 2 = 



-Pu(r) 



d 3 r 



1- 



£0 

T 



V(2i?) = - 



b- 



T 



(333) 



where V(r) is the volume of a sphere that has radius r. This gives interpretation for the phenomenological constants 
a and b in the Van-der-Waals equation of state. In particular note that b is the excluded volume multiplied by 2 , 
where d = 3. Contrary to a common misconception it is only in ID that b equals the excluded volume. 



[7.3] The cluster expansion 



Our objective is to calculate the b n coefficients. These are scaled versions of the coefficients B n in the expansion of 
ln(Z), and they are related to the coefficients 7-n is the expansion of Z. The relation between them is formally the 
same as the relation between commulants and moments in probability theory: 



Zi = 


= Bi 


z 2 = 


= B? + B 2 


z 3 = 


= B? + 3B!B 2 + B 



(334) 
(335) 
(336) 
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Or backwards: 



Bi = Zj (337) 

B 2 = Z 2 -Z? (338) 

B 3 = Z 3 -3Z 2 Z 1 +2Z? (339) 

We can use these relations both directions: First we can evaluate a few Zjy, typically ~L\ and Z 2 , in order to get the 
leading order B n coefficients, say Bi and B 2 . Once the leading order B„ coefficients are known, we can generate from 
them a generalized Gibbs approximation for all(!) the Zjv- 

Diagrammatic procedure.— We turn to outline a general diagrammatic procedure to evaluate the bg for a classical 
gas of interacting particles. A graph (network, diagram) is a set of vertices (nodes) that are connected by edges 
(connectors, bonds). In the present context each diagram represents an integral. The sum over all the connected 
diagrams that have n nodes gives the expansion coefficient B n of the "comulant" generating function ln(Z), while the 
sum over all diagrams (including reducible diagrams) gives the expansion coefficient Z^r of the "moment" generating 
function Z. Formally we write 



1 \ N f ( 1 \ N 

-g- J / dxi...dxjM TT (1 + / (xij)) = ( T3- ) /, [diagrams with iV nodes] (340) 



A/ ' ' ; («> 



In this expression a diagram represents an integral of the type 

C[3',l,2,3] x f[f(x 12 )f(x 23 )} [f(x 56 )} lf(x 78 )f(x 89 )f(x 97 )} d Xl ...dx 9 (341) 



where C is a combinatorial factor that arise because we identify diagrams that differ only in the labeling of 
the vertices. One should realize that if a diagram is reducible, say TV = m + n 2 + TI3, then C[ni, 112,113] = 
[7V!/(ni!n 2 !n3!)]C[ni]C[n 2 ]C[n3]. In the above example C[3'] = 3 is the number of ways to have a triangle with 
2 bonds, while C[3] = 1. Using this observation it is not difficult to prove that 

B„ = I -3- J \J [connected diagrams with n nodes] (342) 

The expression for the b n with prcfactor l/(n!V) is implied. The expressions for the leading coefficients are: 

&i = - J dx = 1 (343) 

h = ^ J f(x 12 ) d Xl dx 2 = ^jf(r)dr (344) 

b 3 = ^r; [3f{X!2)f(X23) + f(xi2)f{x23)f{x 3 i)} dxidx 2 dx3 (345) 



[7.4] From gas with interaction to Ising problem 



Consider classical gas with interactions U {f[, ...,rjv). The N particle partition function is 

Z N = ^(l) 3N fd™re-^-^ (346) 

We see that the kinetic part factors out, hence the whole physics of the interactions is in the configuration integral. 
Therefore, without loss of generality we can consider "static gas". To further simplify the treatment we consider a 
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"lattice gas" version: 



% = U (f[, ..., r]v) = - 2_^ u ( x > x> ) n ( x ) n {x') 



(347) 



The grand partition function is 



z = ^2 cx p 



^(•) 



~^ ^^u(x,x')n(x)n(x') -^^n(a 

\ a:, a:' cc 



(348) 



where n(x) = 0, 1. We define 

l + cr(x) 
n(x) = 



(349) 



where cr(x) = ±1. Then we get 



z = ^2 ex p 



a(x) 



E 



£ (a;, x 1 ) <j{x)a (x 1 ) — h N^ ^(x) + const 



(350) 



where e = u/4, and h = [fj, — u]/2. Here u is the interaction energy per site for full occupation. Note that h = 
implies that a fully occupied lattice has the same energy as an empty lattice. We represent graphically the interaction 
s(x,x') between two sites x and x' by "bonds". The notation (x, x') means from now on summation over all the 
bonds without double counting. In the simplest case there are interactions only between near-neighbor sites. 

We see that the calculation of Z for static lattice gas is formally the same as calculation of Z for an Ising model. The 
following analogies should be kept in mind 



occupation N 

chemical potential fi 

fugacity z = c^ 

^rand canonical Z(j3,/i) 



magnetization M = 2N — Af 
magnetic field h 
define z = e 2 ^ h 
canonical Z(/3, h) 



(351) 
(352) 
(353) 
(354) 



From now on we refer to Ising model, but for the formulation of some theorems in the next section it is more convenient 
to use the lattice gas language for heuristic reasons. Note also that N is more convenient than M because it does not 
skip in steps of 2. 



= [7.5] Yang and Lee theorems 

Consider the lattice gas or the equivalent Ising model. The probability of a given configuration is 

P R oc [e-^«] z N * 
where En is the energy of the interactions, and z is the fugacity. The partition function is 



(355) 



Z(z;(3) = ]TZ w (/3); 



N 



(356) 



N=0 
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The Helmholtz function is 

F(z;0) = -^\nZ{z;0) (357) 

The expectation value of the total magnetization is 

(N) = -0z^F(z;0) (358) 

As we increase z we expect the magnetization (N) to grow. We expect (N)/Af to have a well defined value in the 
limit Af — > oo. Moreover, below some critical temperature we expect to find a phase transition. In the latter case 
we expect (N) to have a jump at zero field (z = 1). The Yang and Lee theorems formulate these expectations in a 
mathematically strict way. 

Given Af it is clear that we can write the polynomial Z as a product over its roots: 

M 
Z(z) = const x TT(z-Zr) (359) 

r=l 

Consequently 

1 N 
F(z) = --^]hi(z-z r )+ const (360) 



and 



' r=l 



(AT) = ZJ2- 1 - (361) 



=i 



^ • 



There is a strict analogy here with the calculation of an electrostatic field in a 2D geometry. In the absence of 
interactions (infinite temperature) we get that all the roots are at z = — 1. Namely, 



Af 



Z(z;p) = ^C#z iV = (1 + z)" (362) 

AT=0 

So we do not have phase transition since the physical axis is < z < 1, where this function is analytic. The questions 
are what happens to the distribution of the roots as we increase the interaction (lower the temperature) , and what 
is the limiting distribution in the thermodynamics limit (Af — > oo). There are three statements that give answers to 
these questions due to Yang and Lee. The first statement is regarding the existence of the thermodynamics limit: 

lim — — — = exists (363) 

The second statement relates specifically to the standard Ising model, saying that all the roots are lying on the circle 
\z r \ = 1. In general other distributions are possible. The third statement is that below the critical temperature the 
density of roots at z = 1 becomes non-zero, and hence by Gauss law (TV) /Af has a jump at zero field. This jump is 
discontinuous in the thermodynamic limit. 
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M 



T<T 








T<T C 






■^p- 




\ / 


V 




h<0 


\ 

h>() 



In the (T, /x) phase diagram the discontinuity is indicated by a solid line that stretch form (0,w) to (T c ,u). However 
in practice it is customary to use a (T, P) phase diagram. Here we bring the phase diagram of water as an example 
[Wikipcdia] : 



solid phase 




/ canpressihle 
liquid 


supercritical fluid 


critical pressure 

Per 




liquid / 
phase / 


critical point 


P, p triple point 




vapour 


gaseous phase 
critical 




Tip 




temperature 

Tcr 

^ 



Temperature 



In the grand-canonical perspective N as a function of /x exhibits a jump at /x = u. In the canonical perspective 
we fix TV and solve for /x. Then we get a "gas" or a "liquid" state provided the N line does not intersect the 
discontinuity. Otherwise there is a phase separation, where the gas and the liquid phases coexist with the same 
chemical potential /x — u. 
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[8] The Ising model 



[8.1] Model definition 



The energy of a given Ising model configuration state is 

E W (')] = - Yl £(x,x')a(x)(r(x')-^2h(x)a{x) 

(x,x') x 

The canonical state is 

Pk(-)]-^e-^«] 



where the partition function is 



Z[h(-),0\ = J2 ex P 



/3 y, £ {x, x) a(x)a (x) + > ^ h(x)a(x) 

(x,x') x 



We expand the Helmholtz function as 

F[/i (•) , T] = F Q (T) - -L ^ g (x, x') h{x)h (x') + O (h 4 ) 



(364) 



(365) 



(366) 



(367) 



From here we get 



(a(x)} 



(a(x)a(x')) = -T 



f^29{x,x')h(x') 

= g(x,x') 



dh{x) T 

OF 



dh{x)dh (x') 
for an homogeneous field we replace F[h (•) , T] by F (h, T) and we get 

BF 

M = -—=N X h + 0(h*) 

with the " fluctuation- response relation" 



^E*w 



(368) 
(369) 



(370) 



(371) 



Unless indicted explicitly otherwise F(T) is calculated for zero field. We have the usual expressions: 



E(T) 

S(T) 

C(T) 



F (T) + TS (T) 
^dF(T) 
dT 

dT dT 



(372) 
(373) 

(374) 
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[8.2] The spatial correlation function 



It is possible to measure g(r) via a scattering experiment. Given a configuration a(x) the intensity of the scattering 
in the Born approximation is 



7(g) a 



a(x)e' ll i- s dx 



(375) 



If we average over configurations we get 



I(q) oc / dxdx' {a{x)a {x'))c- iq < S - S ') oc g(q) 



(376) 



where g(q) is the FT of the correlation function. We shall see that Landau's approach in the Gaussian approximation 
leads to the Ornstein-Zernike expression 



Kl) (X 



This leads to 



1 



q 2 + (I/O' 



(377) 



](r) ~ exp(-r/£) if £ < oo 
j(r) ~ l/r d - 2 if £ = oc 



(378) 
(379) 



The information about order-disorder transition is in g(r). If £ < oo there is no long range order, and we get 

ij>(r) < co (380) 



T 



as £ — > oo the susceptibility diverges which suggests spontaneous magnetization as phase transition. Below the critical 
temperature one should define the correlation function as follows: 



7 (r) = {a{x)a{x'))-{a{x)f 



(381) 



where r = \x — 



= [8.3] Critical behavior and the scaling hypothesis 

Below we display the phases diagram in (T, h) space, and qualitative plots of the state equations. For the 2D Ising 
model with near neighbor interactions T c = 2.27e. 

hA b, — w t 



o 
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The state equations in the critical region are characterized by the exponents a, /J.7, 5 (see below). Two other exponents 
v and 77 arc defined via the critical behavior of the correlation function, which is assumed to be a variation on the 
Ornstcin-Zcrnike expression. Namely, one conjectures that the divergence of the correlation length as T — >• T c is 
described by 

£ ~ \T-T C \~ V (382) 

and that the correlation function is 

9(r) ~ ^J^ exp(-r/0 (383) 

Here we combined the T = T c and the T > T c into one expression. This expression describes the long range behavior. 
Note that the "microscopic" short range behavior is not too interesting because it is bounded by g(0) = 1. The 
divergence of \ is due to the slow power-law tails. Below T c the behavior is similar to T > T c provided the correlation 
function is properly defined. Going on with the same phenomenology the conjecture is that away from T c the 
correlation distance £ is the only relevant length scale in the thermodynamic limit. This means that each "correlated 
block" of the system has the same contribution to the Free energy irrespective of the temperature, hence 

F (T) ~ ^ ex \T-T c \~ vd (384) 

where d is the dimensionality. It is customary to define the scaled temperature as t = \T — T c \/T c , and to summarize 
the above conjectures as a scaling hypothesis that involves the critical exponents: 

F {s- 1 l"t) = s- d F (t) (385) 

g{sr, s~ 1/v t) = s-^- 2+T,) g(r,t) (386) 

From here it follows that 

Cis- 1 '^) = s {2 '^- d C{t) (387) 

X {s- 1/V t) = s 2 -v x (t) (388) 

From the combined scaling relation 

F( S - 1 /^,s-( d + 2 -")/ 2 / l ) = s- d F(t,h) (389) 

we can deduce similar relations for the magnetization. These scaling relations allow to deduce the critical exponents 
a, /3.7, 5 from d, u, r), leading to 

C~ |i|~ Q , a = 2-ud (390) 

M~ \t\ , /3 = (d-2 + 7])iy/2 (391) 

X- t'\ 7 = {2-r,)v (392) 

M~ \h\^ s , S=(d + 2 + n)/(d-2 + n) (393) 

The so called "classical" mean-field exponents that wc derive later are 

v = 1/2, 7] = 0, a = 0, /3 = 1/2, 7 = 1, (5 = 3 (394) 

In order to get a non-trivial result for a we have to take into account Gaussian fluctuations around the mean field 
leading to a = [2 — (d/2)]. One observes that the classical mean-field exponents satisfy the scaling hypothesis based 
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relations if d — 4, but not for d = 3. This implies that we have to go beyond mean field theory in order to establish 
the experimentally observed scaling behavior. 



[8.4] Digression regarding scaling 



A function of one variable has a scaling property if 

F(sx) = s Df F(x) (395) 

where Dp is the scaling exponent. It follows that F(x) = const x Df . For example F(x) — x 2 has the scaling exponent 
Dp = 2. If we have say two variables then the more general definition is 

F(s Dx x, s D yy) = s Df F(x, y) (396) 

Note that the scaling exponents can be multiplied by the same number, and still we have the same scaling relation. 
It follows that there is a scaling function such that 

F ^y) = y DF/Dy f{-^) ( 397 ) 

For example F (x, y) = x 2 + y 3 has the scaling exponents D x = 1/2, D y = 1/3, Dp = 1. More generally any "physical" 
function has an "engineering" scaling property that follows trivially from dimensional analysis. 

== [8.5] Solution of the ID Ising Model 

Assuming only near neighbor interactions 

E[<j] = -e^aicrj -^hiGi (398) 

(ij) » 

The partition function is 

Z[h,0\ = J^e-^W (399) 

<t(.) 

For e — we get 

Z[h,P\ = Jj2cosh(/3^) (400) 



and hence 



N f (h \\ 1 N 

F[h,T] = -Tj2^hcosh(^))^-NT\n(2)-—J2h 2 (401) 



N ' 'h \\ 1 N 

| „-JVTln(2)-- 

i— 1 

The correlation function is 

9(r) - -T^- = 8 i0 = 6 r , (402) 
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and hence the susceptibility is 



rEs(' 



T 



The magnetization is 



1 
T 



(403) 



M 



dF 
'~dh 



N tanh — 
T 



N X h + O (h 3 



(404) 



We turn now to the case e ^ and assume homogeneous field h. This allows to use the transfer matrix method for 
solutions. Let us define the scaled parameters 



(405) 
(406) 



e = /3e 






h = (3h 




and the matrix 




T , n = 
J-cr'cr" — 


exp 


ea'<j"+ l -h{a' + cr") 



~e—h 



The eigenvalues of this matrix are 



A± = e B cosh (h) ± e~ e \ /l + e 4e ~ sinh 2 (h 



The partition function of Af site Ising model on a ring can be calculated as 

Z (0, h) = trace {T M ) = X* + \ M 
and hence for very large N we get 

F(T,h) = -AfT\n(\ + ) 
Expanding we get 

F(T,h) « -MTln (2 cosh (^)) ^Af ***^) h 2 



T 



T 



Hence 



1 ( n z 

X= j, exp(^2- 



Now we would like to calculate the correlation function at zero field. 
g (r) = (<7 a r ) = — ^ cr ^ 0<T ^ r T^- r 



(407) 



(408) 



(409) 



(410) 



(411) 



(412) 



(413) 



We have 



T„ 



1 1 

1 1_ 

V2 \/2 



with 



A+ 
A_ 



l l 

1 1_ 

V2 V2 



T).'-! 



(414) 



A + = 2 cosh (s) 

A_ = 2 sinh (e) 

Using standard Pauli matrix notations and denoting the digonalizcd matrix T as A we get 



;(r) = |trace [a z T r a z T^~ r ] = |trace [a x A r a x A^- r ] = 



For very large TV we get 
'A \ r 



A^ + A^ 



g{ r ) 



\.. 



= c-''/S 



(415) 
(416) 



(417) 



(418) 



where 



In I coth . 



a 2e/T 



(419) 



The calculation of ^2 g{r) involves a geometric summation, and it can be verified that it equals £. The result is as 
expected in agreement with what we got for %. 



[8.6] Solution of the 2D Ising model 



The full details of the Onsager solution for this problem is in Huang. Also here the transfer matrix approach is used. 
Recall that the zero field solution of the ID model is 



The 2D solution is 
1 \nZ = 



In (2) + In (cosh (e)) 



(420) 



In (2) 



1 



d6d$' 



In 



N " y, '2JJ (2tt) 

The integral is determined by the dimcnsionless parameter 
2 sinh (2s) 



(cosh (2s)) + sinh (2s) (cos 9 + cos 6') 



(421) 



(cosh(2f?)) 2 



< 1 



(422) 



The value k = 1, for which InZ exhibits discontinuity in its derivative, is attained for sinh(e) = 1, from which it 
follows that the critical value of the interaction is s — 0.44, leading to T c = 2.27e. This is close to the heuristic value 
T c w 2e that we are going to derive in the next lecture. 
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[9] Phase transitions - heuristic approach 



[9.1] The ferromagnetic phase transition 



The standard Ising Hamiltonian is 



(ij) i 



(423) 



Let us assume that in equilibrium we can regard the spins as quasi-independent, each experiencing an effective held 
h, such that the effective Hamiltonian for the spin at site i is %W = —hai. This means that the equilibrium state is 



P<n...a N <* CX P 



-0hJ2 ai 



(424) 



We have to find what is h. The formal way is to use a variational scheme. We shall introduce this procedure later. 
In this section we guess the result using a self-consistent picture. By inspection of the Hamiltonian if the mean 
magnetization of each spin is (a) , then it is reasonable to postulate that 



h = h + s >J (crj) = h + ce(a) 



(425) 



where c is the coordination number. Form w- 1 ' we get the self-consistent requirement 



(a) = tanh [ — (h + ce(a)) 



(426) 



This equation should be solved for (a), and then we get h as well. 
<G> A 




By inspection of the plot we observe that for h = the condition for getting a non trivial solution is ce/T > 1. 
Therefore T c = ce. If we want to explore the behavior in the critical region it is convenient to re- write the equation 
in the following way: 



h = J 'tantT 1 (a) - T c (a) 



(427) 



and to approximate it as 



h=(T-T c )(a) + -T c (a) 3 



(428) 
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for T > T c we get the Curie- Weiss law 

(a) = — - — h 
w T-T c 

hence we get the critical exponent 7=1. For T = T c we get 



(429) 



(*) = ( ¥ h 



hence we get the critical exponent 6 = 3. For T < T c we get 

i 



(430) 



T 



hence we get the critical exponent j5 = 1/2. 



(431) 



In the mean field approximation the spins are independent of each other, and therefore {&i<Jj) = (ui)(<7j). It follow 
that the energy is 



E 



(H) 



1 



cAf e {af 



(432) 



For the heat capacity we get 



C{T) 



dE 
~dT 



-ceAf (a) 



d(a) 



h=0 



dT 



(433) 



h=0 



For T > T c we get C(T) = which implies the critical exponent a = 0. To get the non-trivial mean-field result 
q = [2 — (d/2)] we have to take into account Gaussian fluctuations. 



AC(T) 




= [9.2] The ant i- ferromagnetic phase transition 

The Ising Hamiltonian with e n- — e 

(ij) k 

We mark the magnetization of the two sub lattices by M a and M&. We define 



(434) 



M = -(M a + M b ) 



M x 



(M a - M b ) 



(435) 
(436) 
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Without the magnetic field, the problem is the same as the ferromagnetic one with M s as the order parameter. With 
magnetic field h the heuristic mean field equations become 



tanh( ^(h-T c M b 



M n 



M b = tanh ( - (h - T c M a ) 



(437) 
(438) 



Following the same algebraic simplification procedure as in the ferromagnetic case, we get after addition and subtrac- 
tion of the two resulting equations, 



(T - T c ) M s + ^T c (3M 2 M S + M 3 ) = 
(T + T c ) M + -T c (3M S 2 M + M 3 ) = h 



(439) 
(440) 



From here it follows that (see graphical illustration): 

'T — T N 

l 



M s = or 3M 2 + M 2 = 3 



T 



(2 + M s 2 )M+ -M 3 



h 



(441) 
(442) 



As expected from the second equation we get M — in the absence of an external field, and from the first equation 
we get the order parameter M S {T), which satisfies the same equation as in the ferromagnetic problem. If we switch 
on the magnetic field T c is shifted to a lower temperature. 




A 



critical line 
(2nd order) 



T 



If the magnetic field h is strong enough, it destroys the anti- ferromagnetic order and causes M s = 0. This is implied 
by the identification the ground state: 



E(t4T4) = M"- [ -\ce j Jior weak fiel.ll 



E (ft It) = -AT • ( -ce - h ) , [for strong field] 



(443) 

(444) 



In the region where T ~ T c and /s~0wc get for the magnetization 



M = 



T c + T[l + \Ms{T)- 



h = xh 



(445) 



We can get a better general expression for all of the temperature range by differentiation of the heuristic equations 



X = 



T c + T cosh 2 C^M 8 (T)) 



(446) 



■57 



In the region T ~ T c substitution of M s (T) gives 



i T < T 

i T < T 

4T C -2T J ^ ^ c 



(447) 



= [9.3] The variational approach 

The canonical state minimizes the Free energy functional. Accordingly we look for a solution to the variation problem 

F [p] = (H) - TS [p] = minimum (448) 

with implicit constraint on the normalization. In the mean-field approach the canonical state is assumed to be well 
approximated by p = {pa}, where 



Vo\...o 



(2cosh(/3/i)) 



M 



exp 



-fih y^ a k 



(449) 



Here the variational parameter h is the effective mean field. We would like to determine the optimal value of h 
for which F [p] is minimal. For the calculation we use the identity F [p] = Fo [p] + (H — Ho), where Ho = —h^ i a, 
leading to 



F[p]=M 



-T In ( 2 cosh ( - 1 1 ce ( tanh ( - ) ) - (h - h) tanh 



(450) 



The variational equation for h is as expected 



h = h + ce tanh 



T 



(451) 



Hence, we get the variational free energy 

h 

f 



F(T,h) = Af ~T\u ] 2cosli 

and the magnetization 

dF (T, h) 



M 



1 ( uf h 

-ce tanh — 

2 V \T 



Oh 



dFdh Kr fh 

Mdh - A ' tanh T 



(452) 



(453) 



The above expressions for F(T, h) and M(T, h) are not convenient because the dependence of h on h is not explicit. 
To make calculations of state equations it is convenient to transform to 



A(T, M) = F(T, h) + hM 



(454) 



such that dA = —SdT + hdM. In order to make the change of variable we note that from the M(T, h) expression, 
and form the mean field equation for h it follows that 



h = Ttanh -1 (M)-ceM 



M 
where M = —7 = (a) 
Af 



(455) 
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Using the identity tanh (x) = (1/2) ln((l + x)/(l — x)) one obtains 



A(T,M) = Af 

= NT 



-Tln2 + -Tln(l-M 2 ) + -TM In ( 1+ w ) - -ceM 2 
2 v ' 2 \1- M I 2 



1 + M , 1 + M 1-M, 1 - M 

In 1 In 

2 2 2 2 



-Af-ceM 2 
2 



(456) 

(457) 



From this expression it is convenient to derive explicit results for the state equations. In particular S = —dF/dT and 
one can recover the result for the heat capacity. 



[9.4] The Bragg Williams formulation 



Consider an Ising model with Af sites, at any dimension, and with any coordination number. Given a spin configuration 
define 



Af = total number of spins 

M = total magnetization 

M = M/Af 

N + = number of up spins = -{Af + M) = -Af(l + M) 

N_ = number of down spins = -(M - M) = -M{1 - M) 

N + _ = number of bonds connecting spins with opposite direction 



(458) 
(459) 
(460) 

(461) 

(462) 
(463) 



It follows that 

(ij) 



M 

1 



cAf - 2N+. 



(464) 
(465) 



If we look on two connected spins, there is a probability (N+/AT) to have the first up, and a probability (N-/Af) to 
have the the second down. The total number of connected spins is cAf/2. The Bragg Williams approximation is 



N, 



cAf . 



AT 






iV_ 

aT 



AT 
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Assuming that it holds for typical configurations we approximate the energy functional as 



E[a] fc -Af x ( ^ceM 2 + hM 



(467) 



We note that this expression with c = Af if formally exact for a fully connected cluster of spins. The number of 
configuration with magnetization M is 



N\ 



9m 



const exp 



-A ( l -M 2 + 1m 4 



(N+y.(N-y. 

In order to derive the latter approximation note that the derivative of ln(<?M) is of the form 



(468) 



(ln(l + x) -ln(l -x)) 



1 3 1 H 

x + -x 6 + -x b 

6 5 



(469) 



With this approximation we get 
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z = EE c ^ Bkl 



M aeM 

A(M) 



N . 



Y,9M^ mM) = ]Tc- A(M) 



M 



M 



- (1 - f3ce) M 2 + -^M 4 - (3hM 



(470) 

(471) 



This sum can be evaluated via Gaussian integration. The dominant contribution comes from the M for which A(M) 
is minimal. Once can easily verify that A'(M) = coincides with the heuristic mean field equation that has been 
discussed in a previous lecture. The critical temperature is T c = ce. For T < T c the trivial mean field solution M = 
bifurcates into two stable minima. Symmetry breaking is implied because the limits h — > +0 and h — > — do not lead 
to the same minimum. 



[9.5] The importance of fluctuations 



One can ask whether it is allowed to ignore the quadratic term in A(M). This should be checked self consistently. 
Using the standard notation t = (T — T c )/T c , and assuming t > we get that the dispersion of M in the Gaussian 
approximation is (Aft)^ 1 / 2 . The quadratic term is negligible if \M\ 4 <C i|M| 2 leading to the condition \t\ ^> 1/A/" 1 / 2 . 
This condition is always satisfied in the thermodynamic limit. The same condition also guarantees that for t < the 
dispersion is much smaller compared with the non-zero mean field M « t 1 ' 2 . 

However, the above analysis is misleading. The Bragg Williams approximation underestimates the effect of fluctua- 
tions. We already know from the exact solution of the Ising model that in ID there is no phase transition at finite 
temperature. We would like to explain in detail why the fluctuations in ID smear away the phase transition. Later 
we shall see that also for d = 2,3 the fluctuations are important: they do not smear away the phase transition, but 
they modify the state equations in the critical region, which explains the failure of mean field theory there, and the 
observed anomalous values of the scaling exponents. 

The problem with the Bragg Williams approximation is implied by the figure below. In the T — panel the energy 
of the states is indicted by bars. By taking the "typical" value of the energy (thick solid line) we ignore a dominant 
fraction of less typical states that have a very low energy. These states corresponds to configurations where the spins a 
bunched in "zones". The simplest arrangement in ID has two zones and its energy is Eb = Eq + 2e, where Eq = —Me 
is the ground state energy. Consequently the effective barrier between the "all up" and "all down" states is very low 
(thick dashed line), and the symmetry breaking is avoided at any finite temperature. The formal argument is outlined 
below. In contrast to that, in 2D the effective barrier is Ej, = Eq +Af 1 ' 2 e and therefore symmetry breaking is realized 
in the thermodynamic limit. 



ilz \Jy 



T>\. 



T<\ 




Domain walls.— It is possible to argue that in ID there is no phase transition at finite temperature. The argument 
goes as follows: consider a mixture of states with one domain wall. For such state E[p] — Eq + 2e, where e is the 
cost of the domain wall. But the entropic contribution is S[p] = In [A/] where AT is the length of the chain. It follows 
that for any finite T the ground state does not minimize F[p] at the thermodynamic limit. Therefore spontaneous 
magnetization at finite temperature is impossible. 

The above argument fails in 2D because the energy of a domain is E[p] — Eq + Kf x ^ 2 e, which does not threat the 
stability of the ground state. In fact it is possible to refine the Domain wall approach and deduce that for the d = 2 
Ising model there exist spontaneous magnetization at finite temperature [see Huang p. 349], in consistency with the 
exact solution. 

The possibility to witness phase transition at d = 2 is limited to systems with discrete symmetries. The Merrnin- 
Wagner theorem states that a continuous symmetry cannot be spontaneously broken at finite temperature in d < 2 
systems with short-range interactions. See discussion of the Heisenberg model. 
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[10] Phase transitions - field theory 
= [10.1] The Landau- Ginzburg model 



We would like to take into account the spatial fluctuations of the magnetization ip(x) in the calculation of the partition 
function. Inspired by the Bragg- Williams approach we introduce the following field theory model: 

Z[h, parameters; A, L] = ^V A[v(-)1 = ^c^^ 1 (472) 

where the sum over configurations becomes an integral with the measure 

D V = J| dp x = J| difk , f>k = -£d / ^( x ) c ~ lkXdx ( 473 ) 

x£L d |fc|<A 

In this definition tpk are the Fourier components of <p(x), and we are sloppy about an uninteresting overall prcfactor. 
It is implicit that one assumes a finite volume L d , and a finite momentum cutoff A, otherwise the integral is ill defined. 
To have a momentum cutoff is like to assume that space is discretized with lattice spacing 2tt/A. Accordingly the 
number of freedoms of the model is M = (LA/2w) d . The assumed action is 

A[p(.)} = J dx^-iV^f + \r^ + \u^ -h^j = L d J2( K \(k 2 +r)Mk)\ 2 + -^ (474) 

Noe that the summation over the k components of the field is conventionally written as an an integral with the 
measure [L/(2ir)] d dk. The convention with regard to the prefactor of the first term fixes the dimensions of <p, and 
hence of the other model parameters. We write these dimensions as L d , accordingly 

d v = — , d h = y~ ' d r = -2, d„ = -(4-d), (475) 

The model has a thermodynamic limit, hence L is not significant, and we can calculate the Hclmholtz free energy F 
per unit length. In contrast to that A is significant, and accordingly we realize that the model contains two significant 
dimensionless parameters r = r/A 2 and u = u/A 4 ~ d . Relating to the Bragg- Williams approximation we identify 

r ex (T — Tc )/T, where T c is the mean field critical temperature. In fact we shall see that the field theory analysis 
implies that the actual critical temperature is pushed down due to the fluctuations (r c < 0). In the ID case the 
critical temperature is pushed to zero (r c = — oo). 

The Landau-Ginzburg model stands by itself as a prototype model. Note however that there is also a possibility 
to motivate it by direct reduction from the Ising model. For this purpose one replaces the discrete summation over 
a = ±1 by an integration over ip with a weight function: 

]r -». /n^e-M^-i) 2 (476) 

<t(.) J x 

One should realize that the ferromagnetic interaction — <t(x)o(x') corresponds to differences (ip(x) — (^(a/)) 2 , and 
hence translates to the gradient term in the Landau-Ginzburg model. 

The field theory that corresponds to the Ising model contains a real field. It reflects the discrete mirror (up/down) 
symmetry of the system. More generally we can consider a system that has a continuous rotational symmetry. In 
such case tp — (<pi, <P2, ^3) is a vector and the Action should be modified in an obvious way. Of particular interest is 
to have complex field ip = {<pi, 1P2) that describes a system with continuous gauge symmetry: this can be regarded as 
a rotational symmetry in two dimensions. 
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[10.2] Digression - Gaussian integrals 



The partition function can be calculated exactly whenever the action is a quadratic form. The so-called Gaussian 
integral reduces to the product of Af one dimensional integrals if we transform it to a basis in which the quadratic 
form is diagonal. For a system that has translation symmetry it is momentum space. 



Dip e - 3 Em Avvtvj+Et hin = Y[ d<p k 



e 2 



n© 



1/2 







I 


(477) 


V / det(2 7 rG) exp 


o / Gjjhjhj 


(478) 






i-j 





here G = A 1 , and note that dct(G) = 1/ dct(A). Note also that going back to the original basis, in the case of 
position-to-momentum transformation implies that G{r) is the Fourier transform of l/a(k). 

From the above result, it follows that G(r) is the correlation function (ip(r)ip(O)} for h = 0. Otherwise (ip) is non-zero, 
and it equals to field ip that minimizes that action. It satisfies the equation Aip = h, whose solution is ip — Gh. Hence 
G can be regarded as the " Green function" . 



[10.3] The mean field approximation 



We define the mean field ip via the equation A (ip) = minimum. The mean field for an homogeneous field h(x) 
obtained from the equation rip + wp 3, = h. In particular for h — ±0 we get 



h is 



yi) 



o, 



for r > 



±(=±Y , forr<0 



A[pa] = 



0. 



for r > 
for r < 



(479) 



More generally the mean field equation is 

(-V 2 +r)ip + wp 3 = h(x) 
The solution up to first order in h(x) is 



(480) 



(p(x) = (p + g( X -x')h (x') dx' + 0(h 2 ) 



(481) 



where 



1 (x — x ) = 



dq e iq ( x ~ x ') 



(2n) a qi + (1/0' 



s = 



r 2, 
(-2r)" 



for r > 
for r < 



(482) 



We recall that g(r) is both the Green function and the correlation function in the Gaussian approximation. Hence 
the mean field exponents are v = 1/2 and rj = 0. 



[10.4] The Gaussian approximation 



We assume r > 0, so the action has a single minimum at ip. Let us neglect the quartic term. Then the partition sum 
factorizes in k space. Not caring about global prefactor we get: 



Z = c 



-am 



n/# 



k exp 



-Y^ + tVO 2 )^ 



-am 



n 



i 



fc 2 + (i/o 2 



1/2 



(483) 
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For r < we have to add the contributions that come from two minima, but if h > the dominant contribution will 
come from the vicinity of the ip > minimum. Assuming that the result is dominated by a single minimum, either 
above or below the critical temperature, the free energy is 

F(T) = TA[0\ + |]Tln(fc 2 + (l/0 2 ) (484) 

k 

In particular we can derive from this expression the Gaussian prediction for the heat capacity. The "mean field" term, 
that originates from A[<p>], contributes a discontinuity. The singular contribution at the vicinity of T c comes from the 
second term, and originates from the second derivative with respect to r. Accordingly 

C(T) = -T^ ~ f A ^r^ ~ \T-T c \- {4 - d)/2 , ford<4 (485) 



dT 2 J (k 2 + r) 

Hence the modified mean field exponent a = [2 — (d/2)]. 

= [10.5] Symmetry breaking 



Let us first assume that r > and ask whether we can trust the Gaussian approximation. Higher non-Gaussian 
terms in the expansion of A[<p] around <p were neglected in the above treatment. The condition for this neglect is 
found in the same way as in the Bragg- Williams analysis. Namely, the neglect of the non-Gaussian term is justified 
if up 4 <C rip 2 , where ip 2 ~ l/£ d ~~ 2 is estimated from Ornstein-Zernike. This leads to the Ginzburg Criterion 

|T-T C (°>| > const u 2 '^-^ (486) 

This condition defines the border of the critical region. Within the critical region the Gaussian approximation breaks 
down because of non-Gaussian fluctuations. 






T<T„ 



T-T 



T „<T 



We now turn to the question whether the fluctuations modify T c or endanger the possibility of having a phase 
transition. Let us take a closer look at the fluctuations. Within the framework of the Gaussian approximation the 
variance of each Fourier component is 

Var(^) = 7^(pt|i7^) W 

The field amplitude ip(x) at a given point in space is the sum of M independent Fourier components. Accordingly the 
total variance is 



VarfcOc)) = 5>ar(^) = / ^^ - = g(0) 



l/L (27T) a fe 2 + (l/0 

This integral is dominate by the low frequency modes if d < 4. The Brag- Williams analysis keeps only the k = 
fluctuations, leading to the diminishing result £ 2 /L d , while the Gaussian analysis takes all the (L/£) d effective modes 
into account, leading to a the non-diminishing result l/£ d ~ 2 . 
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We see that Var(^p) diverge as T approach T c from above. Let us assume that that for T <C T c there is a symmetry 
breaking, such that (ip) jumps from +(p to —ip as h is varied across h = 0. Obviously, in order to observe a phase 
transition we have to require 

VVar(f^) < q> (489) 

It is not difficult to show that within the Gaussian approximation this self-consistency requirement leads again to the 
Ginzburg Criterion. However, this does not imply that there is actually a phase transition. We shall discuss below 
two cases in which fluctuations completely destroy the possibility to observe a phase transition. 

First we refer to the case where the order parameter has a continuous rather than a discrete symmetry. To be specific 
let us assume a complex order parameter ip = (ipi, p 2 )- In such case the potential V(<p) = (l/2)rip 2 + (l/4)</? 4 looks 
like a Mexican hat for r < 0. It means that instead of having two minima we have a continuous set of minima. This 
implies that the low frequency excitations of the system are phonon-like, called magnons in the ferromagnetic context. 
In general the excitations that appear if a continuous symmetry is broken are known as Goldstone excitations. They 
have some dispersion u> = c\k\. Coming back to the partition sum, we see that we have to include the Goldstone 
excitations in the Var((/s) calculation, leading formally to an integral over 1/fc 2 instead of l/(fc 2 + (l/£) 2 ). The 
integral is "infrared divergent" unless d > 2. We conclude that the fluctuations destroy the possibility to observe a 
phase transition at d = 2. This is know as the Mermin- Wagner theorem. 

Going back to the real field case, there are no Goldstone excitations, but we know from the analysis of the Ising model 
that fluctuations destroy the possibility to observe a phase transition if d = 1. This should be contrasted with d > 1, 
for which the critical point is shifted down (r c < 0) but remains finite (see discussion of the RG analysis) . The reason 
for the lack of phase transition here is more subtle. It is clear, even if we have a phase transition, that Var(tp) ~ (p 2 
on the critical line. The question is whether it becomes Var(tp) <C Cp 1 for any finite h away form the critical line. This 
depends on the height of the "barrier" between the two minima, and therefore cannot be deduced from the Ginzburg 
criterion: the latter is based on a local Gaussian approximation that does not know anything about the height of the 
" barrier" . We shall see in the next section that in the d = 1 case the crossover has a finite width: there is no abrupt 
change in (<p) as h — is crossed. 



[10.6] The one dimensional model 



The one-dimensional field model can be solved exactly. This is merely a variation on the "transfer matrix" method. 
The Dip integral is sliced and written as a trace over the product of J\f matrices. Each matrix can be written as 
exp(— dxH) where H is the "Hamiltonian". One realize that this is nothing else but the Feynman path integral in 
" imaginary time" . Let us define 



1 d 2 „, , 1 d 2 



2 dip 2 Kr ' 2 dip 



1 o . 1 



2 r 4' 



H = -oTTT + ^M = -o7TT+ oV + 7^ ( 49 °) 



using the notation tp = dp/dx and x = r with periodic boundary conditions over [0, L], the calculation of the partition 
function goes as follow: 

Z = f D(p e~ So i<p'+v( v )dr = trace ( e -^) = ^ e -^n (491 ) 

•I n 

where (i n are the eigenvalues of H. In the thermodynamic limit F(T, h) — LTfiQ where /xo is the ground state energy 
of H. Similarly (tp) is just the ground state expectation value. For the correlation function we get 

g(r) ex trace [c-( L - r ) ff ^c- rH J ex ^|(n|^|0)| 2 e- r ^-^ (492) 

n 

where in the last equality we already dropped the terms that vanish in the thermodynamic limit. We see that the 
long tails are characterized the correlation length £ = l/(/xi — Ho)- This correlation length does not diverge, reflecting 
that the variation of (tp) is smooth. The crossover at h = has a width that equals the tunnel splitting (/ii — jjlq). 
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The above analysis demonstrates that quite generally quantum analysis of the ground state in d dimensions is like 
classical field theory at finite temperature in d c \ = d + 1 dimensions. This implies that we can hope to observe 
"quantum phase transitions" at d > 1, notably at d = 2, because it corresponds to symmetry breaking of a continuous 
order parameter at d c i > 2. 

===== [10.7] The Heisenberg model 

The Heisenberg model relates to a three component vector field S. It is described by the Hamiltonian 

H = -J^Si-Sj (493) 

The field theory version of the Heisenberg model is the nonlinear sigma model (NLSM), where S(x) is a vector field 
that obeys the constraint IS^x)! = 1. The NLSM energy functional is commonly written as 



E[S] = ±-J\VS\ 2 dx 



(494) 



The Mermin- Wagner theorem:— In the Heisenberg model the order parameter has a continuous symmetry with 
respect to spin rotations. This is different from the Ising model where the symmetry is discrete (up/down). The 
Mermin- Wagner theorem states that continuous symmetries cannot be spontaneously broken at finite temperature 
in d < 2 systems with short-range interactions. This is due to long-range fluctuations, corresponding to massless 
" Goldstone modes" . These fluctuations cost little energy, while contribute large entropy. Hence at finite temperatures 
the T = broken symmetry is washed away. 

Let us outline the argument that explains the dimensionality issue. Assume that there is a long range order. Use the 
Gaussian approximation to calculate the correlation function around the selected minimum. The Goldstone modes are 
massless (which is like r = in the Landau-Ginzburg model). Hence g(r) = FT[l/fc 2 ]. The area under g(r) diverges 
due to the low wavelength fluctuations if d < 2, indicating that the minimum is washed away. 

===== [10.8] The XY model 

The XY model relates to a two component vector field S. It can be regarded as a 2D version of the Heisenberg model. 
The orientation of the ith spin in the XY plane is ipi. Accordingly we can write 

H = -J^cosivi-ipj) (495) 

Its the continuous field version is defined by the energy functional 

E[<p] = ^J(Vcp) 2 dx (496) 

The low lying excitations of this model are vortexes. For example, a single vortex at the origin is represented by 

ip(z) = q axg(x,y) = qtan^iy/x) (497) 

\V(p\ = (q/r) [tangential] (498) 

E[<p] = Kq\n{L/a) (499) 

where L is the radius of the system, and a is the lattice spacing. One realizes that F = Vip is like a 90deg rotated 
version of a field dues to a charge q in a 2D electrostatic problem, while E[<p] corresponds to the electroststic energy. 
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Accordingly if we have several vortexes we get 

E[<p] — const — K y^ qjQj In |r, — Tj | (500) 

which is formally the expression for the energy of a Coulumb gas in 2D. 

Note about complex representation.— Consider a field F — (F x ,F y ) in 2D that has no sources (V • F = 0) and 
no circulations (V x F = 0) . These two conditions are the Cauchy-Riemann equations that allow to represent the field 
by a complex differentiable function f{z) = F x — iF y , where z = x + iy. Note that in sloppy notations F = f(z)*. 
Note that such functions are called holomorphic and hence they are analytic, i.e. have a convergent Taylor expansion 
at any point. Note also that a holomorphic function can be regarded as a conformal map w — f(z) that induces a 
local transformation df — Cdz, where the local derivative C can be regarded as a rotation (due to its phase) combined 
with dilation (due to its modulo). From the Cauchy-Riemann no-circulation condition it follows that the field can be 
derived from a scalar potential (F = — W). Optionally from the Cauchy-Riemann no source condition it follows that 
the field can be derived from a vector potential (F = V x (0, 0, A)). We can summarize these two options by defining 
a complex potential $(z) = V(z) + iA{z) such that f(z) — —ty'(z). The lines of constant A(z) are called "stream 
lines", while the lines of constant V(z) are called "propagation fronts". Differences of A have the meaning of "flux", 
while differences of V have the meaning of "work". For a point charge in a 2D electrostatic problem the Coulomb 
field is f(z) — q/z and ^(z) — — qhi(z) corresponding to V(x,y) = — qln(r). The vortex in the XY model is just a 
rotated version of a Coulomb field with f(z) — —iq/z and ^(z) — — iq]n(z). The energy functional can be written as 

E[*] = ^J\*'(z)\*dx (501) 

The Kosterlitz-Thouless transition.— Let us see what happens if we have a collection of vortexes. The entropy 
which is associated with the number of possibility to position the vortex in the lattice is S = ln[(L/a) 2 ]. Accordingly 
the introduction of a vortex into the system implies 

F[p] - E[p}-TS[p] - (K-2T)]n[(L/a)} (502) 

Hence for T > (K/2) the creation of a vortex is favored, so we expect to have a gas of quasi-independent vortexes. For 
T < {K/2) these votexes have to pair, which is like not having vortexes. The above argumentation implies an "infinite 
order phase transition". In consistency with the Mermin- Wagner theorem it does not lead to "long range order". 
Rather it describes a transition from "conventional disordered phase" at high temperature, to "quasi-long-range 
ordered phase" at low-tcmpcraturc. The correlation function goes from exponential above the critical temperature, 
to powerlaw at the critical temperature and below. At the critical temperature g(r) = 1/r 1 ' 4 . 

= [10.9] Renormalization Group (RG) analysis 

In the field theory model, wc should perform the sum over all the field configurations ip(x) or cquivalently over all of 
the Fourier components ifk ■ In order to get a finite result we should in general use two cutoff parameters: 

L = linear size of the model (503) 

A = largest momentum scale (504) 

Consequently wc define 

F(a,r,u,h,c;A,L) = -\nZ (505) 

where the action of the Landau-Ginzburg model is 

A[ip;a,r,u,h,c] = dxl-a(Vip) + -rip 2 + -uip A — hip + c) (506) 
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where by convention the field <p is scaled such that a = 1. Note that re-scaling of ip shifts the constant c, which we 
ignore below, because it does not affect the correlation function. Below we use the symbolic notation g for any of the 
parameters {r, u, h} that define the model. 

Engineering scaling.— There is always a trivial "engineering" scaling relation 

F(s d »g,a;A/s,sL) = F(g,a;A,L) (507) 

The dimensions d r and d u and dh are fixed by the requirement d a — 0. When combined with the thermodynamic 
limit we get F{s dg g 1 a; A/s, L) = s~ d F(g, a; A, L), which we prefer to rewrite as follows: 

F(g,a;A/s) = s- d F( S - dg g,a;A) (508) 

From now on we suppress L, and assume that the volume of the system is fixed. 

Stepl of RG.— Perturbation theory allows to integrate the high Fourier components within a shell A— <5A < k < A. 
This is called "stepl" of the RG procedure, leading to 

F(g,a;A) = Fig', a'; A/s) (509) 

where s = A/ (A— <5A) = q sa / a is called the scaling parameter. Later it is convenient to use the optional parametrization 
s — e T . The elimination of the high Fourier components spoils both A and a. 

Step2 of RG.— In "stcp2" of the RG procedure the original value of A is restored using the engineering scaling 
relation. Accordingly we write 

F(g,a) = s- d F( S - d «g',a') (510) 

where both L and A are fixed and hence not indicated. 

Step3 of RG.— In "step3" of the RG procedure the field ip is re-scaled, such that the convention a = 1 is restored, 
hence one can write 

F(g) = s- d F(g s ) (511) 

where now we suppress a as well, because it is fixed. In the case of the Landau-Ginzburg model we can illustrate the 
RG flow in a (r, w, r) space. Increasing r means lower resolution description of the systems, with effective coupling g s 
where s = e T . From the perturbation theory analysis one finds a relation g s — R(s)g that maps points g to points g s . 
We have the group property R(s2)R(si) = R(s2Si), or optionally we can write R(t2)R{t\) — R(t2 + 7~i). 

RG equation.— It is convenient to re-define each parameter as dimensionless using the substitution g := A dg g. 
Consequently one obtains an equation of the type 

— = (3{g), [optional notation: dr = d In A] (512) 
dr 

Note that in the right hand side A is absorbed due to the re-definition of the units. In the Landau-Ginzburg model 
case the RG equation is 

dv 

— = 2f-3™ + 3u (513) 

dr 

dii 

— = (4-d)u-9u 2 (514) 
dr 

where f = r/A 2 and u = u/A A ~ d . Note that u of Huang is what we defined as u/4. 
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RG flow.— The RG equation defines flow in g space. We want to understand how the RG flow explains the observed 
universal critical behavior. Consequently we identify the "critical point" as the fixed point of the RG flow. In the 
Landau-Ginzburg model we have two fixed points: The Gaussian fixed point f = 0, w, = and the nontrivial fixed 
point r c = — (4— d)/6, u c = (4— d)/9. For d < 4, as expected from the Ginzburg criterion, the Gaussian fixed point is 
unstable and the flow is dominated by the nontrivial fixed point. One observes that the critical temperature (r c ) is 
shifted below the mean field Gaussian value. 

The critical exponents.— We want to understand how the RG flow explains the scaling hypothesis. In the vicinity 
of the fixed point we can linearize the RG equation. After linear transformation and shift, such that g = is the fixed 
point, we get an equation of the type dg s /dr = \g, whose solution is g s = c Xr g. Consequently we deduce that 

F(g) = s- d F( S - D <>g) (515) 

where D g = — A is called the anomalous dimension. Note that the D of Huang is defined with opposite sign (he is using 
inverse length units). Note again that "g" stands for a collection of parameters, each having its own "dimension". 
The temperature is considered to be a "relevant" parameter because we can associate with it a negative dimension 
D < 0. This means that the coarse grained system looks less and less critical as r is increased. Parameters with 
positive dimension D > vanish from the model due to the coarse graining and therefore are called "irrelevant". For 
further details see Huang. 
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Fluctuations and Response 



[11] Fluctuations 



In order to know the expectation value of an operator we need only spectral information which is present in g(E, X) 
or in Z(f3,X). Note that these functions contains only spectral information about the system (no information on 
the dynamics). Still it is enough for the calculation of the conservative force. For example, in case of a canonical 
preparation 

IT\ I dn \ X- ( dE A l d\n(Z) 

Wo = \-ax) = \^{-dx) = -p^x~ (516) 

In contrast to that, the fluctuations of F(t) — (F)o require knowledge of the dynamics, and cannot be calculated from 
the partition function. For simplicity of presentation we assume below that the fluctuating quantity of interest is 
re-defined such that (F)q = 0. 



= [H«l] The classical power spectrum 

Consider a stationary stochastic classical variable F(t), and define its correlation function as 

C(ta-ti) = (FfaMh)) (517) 

The power spectrum C(lj) is the Fourier transform of C(t), where t = t% — t\ is the time difference. In practice a 
realization of F within time interval < t' < t can be Fourier analyzed as follows: 

F u = f F(t')e iut ' dt' (518) 

Jo 

and we get the "Wiener-Khinchin theorem" 

(|F W | 2 ) = C(w)xt (519) 

where we assume that t is much larger compared with the correlation time. 

== [H«2] The quantum power spectrum 



We consider a system whose dynamics is generated by Hamiltonian %. We assume that it is prepared in a stationary 
state, possibly but not necessarily a thermal states. Our interest is in the fluctuations of an observable F. We use the 
common interaction picture notation F(t) = c FcT . The non-symmetrized correlation function of F is defined 
as 

S(t) = (F(t)F(0)) (520) 

This function is complex, but its Fourier transform is real and can be calculated as follows: 



n m 



In the case of a microcanonical preparation at some energy E, this is the same object that appears in the Fermi- 
Golden-rule (FGR) for rate of decay due to transitions to other levels, namely 

r FGR . = S E (to) x A 2 , tor H-f(t)F, with f(t) = A C - int (522) 
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See further discussion of the FGR in the Kubo formula section. In the above formula to > corresponds to absorption 
of energy (upward transitions), while u < corresponds to emission (downward transitions). 

It is a straightforward algebra to show that for a canonical preparations at temperature T, where p n oc exp(—E n /T), 
there is a detailed balance relation: 

St(-uj) = exp(-^) S T (w) (523) 

This implies that if we couple to the system another test system (e.g. a two level "thermometer") it would be driven 
by the fluctuations into a canonical state with the same temperature. We define 

C(lu) - l(5(w) + 5(-«)) (524) 

K(u) = % t (S(u) - S(-oj)) (525) 

and deduce that at thermal equilibrium they are related as follows: 

K(u>) - 4 tanh (^) G{UJ) (526) 

We shall see later that K(ui) determines the absorption coefficient of the system, hence the above relation is going 
to be the basis for a "fluctuation-dissipation relation". But first we would like to generalize the characterization of 
fluctuations to the case of having several different observables. 

= [11-3] Fluctuations of several observables 

Give several observables T J , and assuming that the system is prepared in a stationary state, the fluctuations can be 
characterized by the correlation function 

S kj (t) = (J*(t)P(0)) (527) 

The associated spectral function S i(u>) is defined by a Fourier transform. For simplicity we use the notations J-" 1 = A 
and J- 2 = B, and write the spectral decomposition 

~S AB {w) = Y,Pn E A nm B mn 2nsL- Em ~ E A (528) 

It is convenient to write S AB (t) as the sum of two real spectral functions that have a good classical limit: 

S AB (t) = C AB (t)-i^K AB (t) (529) 

C AB (t) = ±(A(t)B(0) + B(0)A(t)) (530) 

K AB (t) = l -([A(t),B(0)}) (531) 

We use the notations S AB (u>), and C ab (oj), and K ab (uj) for their Fourier transforms. With regard to the spectral 
decomposition of C AB (ui) and K AB (u>) we note that it is convenient to write p n = f(E n ). We can simplify these 
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expressions by interchanging the dummy indexes n,m in the second term. Thus we get 

C AB (") = \ ^2(f( E n) + f(E m )) A nm B mn 2n5 L - E "^ E A (532) 

n,m ^ ' 

K (w) = iuj 2_^ p - p A nm B mn 2tt5\uj I (533) 



t-I rp Ibllb libit "" I — J- 

n.m m " v 



Note that for a canonical state f(E n )—f(E m ) = ta,nh((E n ~E m )/(2T)) x (f(E n )+f(E m )). Note also that in the 
expression for K AB (ui), the u> cancels the E m —E n denominator. The reason for pulling u> out of the sum is to 
emphasize the low frequency dependence. 

= [11.4] Reciprocity relations 



There are some reciprocity relations that should be noticed. First we note that by definition S AB (£) = [S BA (— £)] . 
In practice it is more illuminating to write the FTed version of this reciprocity relation, which can be directly deduced 
by inspection of the spectral decomposition. Namely, 

S ab (lj) = [S BA (u)Y (534) 

It follows that C ab (uj) = [C ba (uj)]* , while K ab (uj) = -[K BA (co)]* . There is a second reciprocity relation that 
follows from time reversal invariace. Also here it is simpler to look on the spectral decomposition and to remember 
that [j4 n , m ]* = -A„* iTO *, where n* and m* are the eigenstates of the time reversed Hamiltonian. In practical terms it 
means that one has to reverse the sign of the magnetic field h. Consequently [S ba (oj; h)] — [±]S ba (uj; —h), where 
the plus (minus) applies if the signs of A and B transform (not) in the same way under time reversal. Combining 
with the trivial reciprocity relation we get the Onsager reciprocity relation 

S AB {io-h) = [±}S BA (u;-h) (535) 

The Kubo formula that we discuss in the next section implies that the same reciprocity relations hold for the response 
kernel ot k \ to the susceptibility x k ^ and to the DC conductance G ■ 7 . These are called Onsager reciprocity relations 

= [H.5] Detailed balance 



We can also generalize what we called " detailed balance relation" . In the quantum context this is a relation between 
"positive" and "negative" frequencies. Assuming that the system is prepared in a canonical state we have 

S AB {-u) = ^v(-y) St» (536) 

From here it follows that 

Kf: B (uj) = i — tanh I — ) C AB (uj) [quantum canonical version] (537) 

The Kubo formula that we discuss in the next section is expressed using K (t). But it is more convenient to 
use C AB (t). The above relation between the two is the basis for the FD relation and its generalizations. 

It is interesting to look on the classical limit of the generalized detailed balance relation, which can be obtained by 
averaging of the classical microcanonical version (a clue with regard to the derivation of the latter will be presented 
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later when we discuss the classical fluctuation-dissipation relation): 

Kf: B (uj) = iu x — Gp B (cj) [classical canonical version] (538) 



*"(") - ^wm 



g{E) Cg (ui) [classical microcanonical version] (539) 



Optionally the classical canonical version can be regarded as the low frequency limit of the quantum relation. It looks 
very nice in time domain. Omitting the superscript and using dot for time derivative it reads 

K T (t) = -I C T (t) (540) 

We shall use the latter version in order to derive a generalized fluctuation-dissipation relation for the DC conductance. 
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[12] Linear response theory 

= [12.1] The notion of linear response 



Let us assume that X(i) is an input signal, while F(t) is the output signal of sonic black box. Linear response means 
that the two are related by 

/oo 
a{t - t') X(t') dt' (541) 

-oo 

The response kernel a(t — t ) can be interpreted as the output signal that follows a 5(t — t ) input signal. We assume 
a causal relationship, meaning that a(r) — for r < 0. The linear relation above can be written in terms of Fourier 
components as: 

F u = x(w)X w (542) 

where x( w ) is called the generalized susceptibility. Because of causality x(cj) is analytic in the upper complex plane. 
Consequently its real and imaginary parts are inter-related by the Hilbert transform: 

ROIXM] = f*"^ (543) 

J -co W' - W 7T 

(the reverse Hilbert transform goes with an opposite sign). The imaginary part of x( w ) is the sine transforms of a(r), 
and therefore it is proportional to u> for small frequencies. Consequently it is useful to define 

XoM = Re[x(w)] = / a(r) cos (wr)dr (544) 

Jo 

, . Im[x(w)l f°° . .sin(wr) , ._... 

r\(u) = — L v n = / a(r) — - — -dr (545) 

w J ' w 

and write 

F w = X o(u)X u -r]{w)X w (546) 

For low frequency "DC driving" we can regard %o & n d r\ as constants. Note that 

/■OO 

»j [DC limit] = / a(r) rdr (547) 

Jo 



^=^ [12.2] Rate of energy absorption 

We first consider the case of one parameter driving, where T is the generalized force conjugate to X. The postulated 
LRT relation regards X(t) as a small deviation from some reference value Xq = 0, while F(i) is the induced deviation 
from the equilibrium value. Accordingly F u = xqX^ — rjX^ implies 

(F) t = (F)o +Xo (X-X Q ) -rjX = (T) x -r]X (548) 

The in-phase response gives the conservative effect, and in the second equality it has been absorbed into the first 
term. The out-of-phase response gives the dissipation. The dissipative term is responsible to the irreversible work as 
discussed in the "Work" section. The rate of dissipation is W = r\X 2 . 
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The above considerations regarding dissipation can be generalized to a source that has an arbitrary power spectrum 
("AC driving"). The irreversible work equals the integral over r)(uj)X 2 . For a stationary driving source X 2 is 
proportional to the measurement time. So we can write 

/OO 7 

»7(w) 5 source (w) — = vac x RMS[A] 2 (549) 

Possibly it is more transparent to consider a pure AC source that has a definite frequency f2. In such a case we write 

X(t) = ReL4e- im ] (550) 

(7) t = Re[ X (n) Ae~ iQt ] = xofflX - v {il)X (551) 

W = (-XT)t = r?(0) x {l/2)[Afl] 2 , [averaged over cycle] (552) 

Note again that only the out-of-phase response gives dissipation, and that Ail/^/2 is the RMS value of sinusoidal 
driving. 

= [12.3] LRT with several variables 

Commonly the Hamiltonian H(r,p; Xi, X2, X3) depends on several control parameters. Then we can define gener- 
alized forces in the usual way: 

** - -55 (553 > 

It is implicit above that the Xj represent some small deviation form some initial value X(0) = Xq = 0. Consequently 
the postulated linear response relation is written as 

/oo 
a kj {t-t')Xj{t')dt' (554) 

j -°° 

The low frequency limit of the linear relation between the generalized forces and the rate of the driving can be regarded 
as a generalized Ohm law that includes an additional "geometric" term. Disregarding the conservative contribution 
and changing notation for the dissipation coefficient the one parameter version (J 7 ) = — GX is generalized as follows: 

(T k ) = - Y, G kj Xj = r]X -BAX (555) 

j 

where r) k i and B k ^ are the symmetric and anti-symmetric parts of G kj . The rate of dissipation is given by 

VV = -J2(^ k )X k = ^rjkjXkXj (556) 

k k,j 

Note that only J7 fcj contributes to the dissipation. 

^^^ [12.4] The Kubo formula 



The Kubo formula is an expression for the response kernel that relates the expectation value (A) t of some observable A 
to driving field /(£), where the driving term in the Hamiltonian —f(t)B involves the operator B. 



a Aa {t) = Q(t) (- A(t),B ) = Q(t)K AB (t) [Kubo formula] (557) 
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The formula has a good classical limit, and has various derivations. See "Lecture notes in quantum mechanics". The 
DC value of the dissipation coefficient is obtained by integration: 

/>oo 

V AB [DC limit] = / K ab (t) rdr (558) 

Jo 

More generally, an expression for the generalized susceptibility follows from the convolution theorem: 

Of particular interest is the case where A = B = T is a generalized force that is conjugate to the variation of some 
parameter X such that /(£) = (X(t) — Xq). This is the case of interest in the study of friction (where / is the 
displacement velocity) and in the study of electrical conductance (where / is the electromotive field). From the 
definition it follows that K(—t) = —K(t), hence K(u>) is pure imaginary, and consequently the friction coefficient is 

rj(uj) = ■ = — — K(u>) [Kubo formula for the dissipation coefficient] (560) 

So far we have used versions of the Kubo formula that are " good" both classically and quantum mechanically. In the 
quantum case one can write a version of this formula that involves the non-symmetrized corresltion function: 

f](uj) = -—- [S(lj) — S(— oj)] [Quantum FGR version of the Kubo formula] (561) 

Zuuj 

This expression can be deduced directly from the FGR picture as follows. Assume that "H driving = —f(t)V with 
/(£) = Asm(flt). From the FGR it follows that the rate of energy absorption due to upward transitions is 
(A/2) 2 S(ii)fl. Similarly the rate of energy emission is (A/2) 2 S(— £1)0. The net rate of heating is the difference. 
By definition it is written as W — T](fl)[f 2 }, where [f 2 ] = (l/2)[Att\ 2 . Hence one deduce the above expression for i]. 

Below we discuss the non-trivial generalization of the Kubo linear response formalism for the case of Hamiltonian 
that depends on several parameters. We start with the dissipation-less quantum adiabatic limit, and continue with 
the full linear response analysis. 

= [12.5] Adiabatic response 

For an extended presentation see "Lecture notes in quantum mechanics". Given an Hamiltonian "H(r,p; Xi, X2, X%) 
that depends on several control parameters, we find the adiabatic basis \n(X)) with eigenenergies E n (X). Then, for 
a given level, we define in parameter space the "Christoffcl symbols" and the associated "curvature field" as follows: 

<} m{X)\ (562) 



Km = ih(n(X) 



dX j 



B% = diAi-fyAi (563) 

We use the notation A J n = A 3 nn , and note the following identities: 

Km = j^ 1 [n^m] (564) 



x -. 2 /am [TL^L ] 

(E m — E n ) 



B n = Y, ^ "" \T ( 565 ) 



m(j^n) 
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If we have 3 control variables it is convenient to use notations suggesting that we can formally regard A n as a vector 
potential whose rotor B n is formally like a magnetic field: 

X .— ► (X 1 ,X 2 ,X 3 ) (566) 

A —► {A\ nl Al n ,Al n ) (567) 

B .— ► (B 23 ,B 31 ,B 12 ) (568) 

With the above definitions we can cast the Schrodingcr equation into a very convenient form. This is done as follows. 
We write 

||V>) = ~lnX(t)) |V) (569) 

We expand the state in the adiabatic basis as 

\4>) = ^>„(i) \n{X{t))) (570) 

n 

and get the equation 

— f = --(E n -X ■ A n )a„ - - V] W nm a m (571) 

at n h *—' 

m 

where 

W nm = — 2_, XjA 3 nm for n^m, else zero (572) 

3 

In the adiabatic limit the first order response of a system that has been prepared in the nth adiabatic state is 

(T k ) = -Y,B k n j Xj = BAX (573) 

3 

We shall explain in the next section that this corresponds to the geometric part of the response in the Kubo formula. 
The Kubo formula contains an additional non-adiabatic (dissipative) term that reflects FGR transitions between 

levels. 

^^^^= [12.6] Low frequency response 

Here we go beyond adiabatic response and discuss both the adiabatic and dissipative terms that are implied by the 
Kubo formula. Recall that the Kubo expression for the response kernel is a k: >(T) — O(t) K k: >(T), whose Fourier 
transform is the generalized susceptibility: 

./-co u> - lj' + i0 2n 
Taking into account that Ke[x k ^ (oj)} is symmetric with respect to u> we have 

Qkj = Um Im[x fcj (^)] = lim dk = r Kkj{T (575) 
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The last expression (in time domain) is mentioned for completeness. In practice it is more convenient to proceed in 
frequency domain. After some straightforward algebra we get 

c« = i lini M^H]_r^Re[^M] s nkj + Bkj (576) 



2 w'-^b lu J_ cc 2ir up- 



We notice that rj i is a symmetric matrix while .B fej ' is anti-symmetric. Hence in abstract notation the linear-response 
relation can be written as a generalized Ohm law: 

{F k ) = - Y, G kj Xj = r]X -Bf\X (577) 

3 

This is a generalization of the adiabatic response formula. The additional term takes into account the FGR non- 
adiabatic transitions between levels. To see clearly the connection we substitute the spectral decomposition of K i (ui) 
and get the following expressions: 

v*m = P iRkj Ml <u = y „ y ( z3h&, H m ^L \ (578) 

J_ oo uj-w' + i0 2tt ^ Fn ^\huj-(E m -E n )+iO huj+{E m -E n )+iOj v ; 



n m 



and 

_ i M^M . . rtE /( g )-/( g „) ^ ^ 

2 w^o a; ^^ £/„ — ii m 

ra,m 

./_„, 2tt cj 2 ^ (£ m - E n ) 2 

The Re[] and Im[] are not required because the doubles summation cares for that. To expression for B J can be 
written in a way that better emphasize the relation to the analysis of the adiabatic response: 

*" = £/(£„) £ ^^H = £/(^)^ (581) 
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[13] The fluctuation dissipation relation 
= [13.1] General formulation 



The essence of the fluctuation dissipation (FD) relation is to relate the response of a system to its fluctuations in 
equilibrium. In order to derive this relation we have to supply information on the preparation of the system, which 
is typically assumed to be canonical. In the classical context there is a useful microcanonical version from which 
the canonical version can be derived. The standard formal derivation of the FD relation is based on the generalized 
detailed balance relation that allows to express K k i{uj) using C k: >(uj). There are two useful cases: DC version for any 
number of variables, and AC version for one parameter driving. In the DC case we know from Kubo that G is an 
integral over tK(t), and from the detailed balance relation that K(t) can be obtained from the derivative of C{t). 
After integration by parts we get 

1 r°° 

G AB = i / C AB (r) dr (582) 

2 Jo 

We turn now the the one-parameter driving AC version. Here we realize that the imaginary part of the susceptibility 
is determined by K(u), and hence from the detailed balance relation it follows that 

MXM] - ^tanhf^W) (583) 



uj Huj \2T J 

Both versions, the DC and the AC are consistent and imply that the low frequency dissipation coefficient is relate to 
the equilibrium intensity of the fluctuations: 

n = g, VT ~ T {:F{T):FmT dT (584) 

Note that in the above writing we assume that the equilibrium value of the fluctuating force is (J 7 ) = 0, else T should 
be re-defined so as to have a zero average. 



[13.2] The diffusion-dissipation picture 



We can illuminate the physics of FD for DC driving using a simple diffusion-dissipation picture. We show below that 
the DC energy absorption rate is related to the induced diffusion in energy space. To simplify the presentation we 
use a classical language. We can deduce that the driving induce diffusion in energy space from the relation 

E{t) - E(0) = -X [ F{t')dt' (585) 

Jo 

leading to 

((E(t) - E(0)) 2 ) = X 2 f f (T{t')T(t"))dt'dt" (586) 

Ja Jo 

where the averaging assumes a microcanonical preparation. Thus we get 

8E 2 (t) = 2D E t (587) 

where the leading order estimate for the diffusion is 

1 • f°° 1 • 

D E = -X 2 (T(r)T(0)) E dT = -v E X 2 (588) 
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On long times we assume that the probability distribution p(E) = g(E)f(E) of the energy satisfies the following 
diffusion equation: 

The energy of the system is (H) = J Ep(E)dE. Taking the time derivative and integrating by parts, it follows that 
the rate of energy absorption is 



w = *< w >—y rf£g(s) ^^lgMJ (590) 

For a microcanonical preparation p(£ l ) = 5(E — £). Substitution and integration by parts leads to 

™ = i^ = m^ [g{E) De] 1=s (591) 

By definition W = rjX 2 and De — (\/2)vX 2 . Consequently the diffusion-dissipation relation reduces immediately to 
the microcanonical version of the fluctuation-dissipation relation: 

The canonical version r\ = i>t/{2T) can be derived from the integral expression for W, upon the substitution 
p(E) = (l/Z)g(E)e~" E . Optionally it can be obtained from the microcanonical version by canonically averaging 
over E and performing integration by parts. 



[13.3] The wall formula 



The first prototype application of the FD relation is to the calculation of the friction in the Brownian motion problem. 
Consider a gas of particles in a box. The system is driven by moving in it a "spoon", or a "piston" or a "Brownian 
body". 

• The parameter in "H(X) represents the position of the spoon. 

• The generalized force is the Newtonian force (J 7 ) on the spoon. 

• The DC linear response relation (J 7 ) = — r/X describes friction. 



• 



The dissipation rate is W = r]X 



2 



Our purpose below is to find an expression for the friction coefficient r\ using the FD relation. For this purpose we 
have to calculate the intensity vt of the fluctuations of ~F at equilibrium, and to use the relation r\ = ut/(2T). 

Due to random collisions of the gas particles, the Brownian body experiences a "random force" that can be written 
as the sum of short impulses: 

F(t) = ^2mvj 5{t-tj) (593) 

3 

Here tj is the time of the jth collision with velocity Vj at the x direction. Note that |t>j| ~ vt, where vt = (2T/m) 1//2 
is the thermal velocity. The rate of collision for TV particles is 

I- Wx (*)x5i <m 
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where L 3 is the volume of the box that holds the gas particles, and A is the effective area of the moving wall. 
Accordingly the intensity of fluctuations is 

v T = C(uj = 0) = —(mv T ) 2 = m 2 u| — A (595) 

T L 6 

and for the friction we get 

V = ^C(w = 0) = pv T xA (596) 

where p — (N/L 3 )m is the mass density of the gas particles. 

We note that if the dynamics of the Brownian body is described by a Langevin equation, then v /rj = 2T implies that a 
canonical equilibrium is reached. For more details see the Langevin section. This was in fact the historical deduction 
of the FD relation by Einstein in the context of Brownian motion study. If the Brownian particle is moving in an 
incompressible fluid the above result docs not apply. Instead the friction is given by Stokes Law, and we can use the 
FD relation "in reverse" in order to deduce the intensity of fluctuations. 

= [13.4] The Drude formula 

The second prototype application of the FD relation is to the calculation of electrical conductance. Here we show 
how to derive the Drude formula for a gas of classical particles in an EMF driven ring. For an extended discussion of 
electrical conductance see the "additional topics" section of the lecture notes. 

We consider a ring driven by an electro- motive force (EMF). The interaction term originates from the kinetic term 
[p — (e/X)<I>(t)] 2 /(2m), where $ is the flux and / = (e/L)v is the conjugate current. Optionally we can say the the 
interaction is — A(t)v, where A(t) is the vector potential and the velocity v is the conjugate variable. Summarizing: 

• The parameter in "H(<I>) represents the magnetic flux. 

• The generalized force is the current (X) in the ring. 

• The rate in which the flux is varied determines the EMF = — $ by Faraday law. 

• The DC linear response relation (X) = G x EMF is Ohm law. 

• The dissipation rate W = G$ 2 describes Joule heating. 

Our purpose below is to find an expression for the conductance G using the FD relation. For this purpose, following 
Drude, we postulate what is the velocity-velocity correlation function; calculate the intensity vt of the fluctuations 
of X at equilibrium, and use the relation G = vt/{^T). 

Following Drude we assume an exponential velocity- velocity correlation function with a time constant tq, such that 
the mean free path is £ — Vtto- Hence we deduce that C vv (u>) is a Lorentzian, and in particular 

D = \c vv {0) = v 2 t t (597) 

where vt — (2T/m) is the thermal velocity. From here we get an expression for the intensity of the current fluctuations 

v T = C(co = 0) = (^) C„„(0) (598) 

and for the conductance of A^ particles we get 
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where A is the cross section of the ring. Optionally the Drude expression can be written as 

G™ = e»(^U - £a4 (600) 

\mvLJ L 2tt L v ' 

where M. corresponds to the effective number of open modes. There is a very simple toy model for which the 
"exponential" velocity- velocity correlation can be deduced, and hence £/L can be evaluated analytically. Consider a 
ring with a single stochastic scatterer that has a transmission g. The current-current correlation function at given 
energy is 

C(t) = e 2 — 2_, (2.9 — 1)™ S I t — I — n ) ) [exponential decay of correlations] (601) 

n— — co ^ ^ ' ' 



From the integral it follows that 

L ' \-g 

Assuming that g is independent of energy the result applies to any occupation. 



(602) 



Conductor in electric field.— A straightforward generalization of the above applies for an extended piece of metal 
that in placed in a time dependent electric held. The electric held is described by a vector potential such that £ = —A. 
The interaction term in the Hamiltonian is an extended version of the simplified — A(t)v that we have assumed in 
previous discussion: 

H lnt = - [ J(x) ■ A{x) d 3 x (603) 



In linear response theory the current is proportional to the rate in which the parameters are being changed in time. 
Regarding the values of A at different points in space as independent parameters the postulated linear response 
relation takes the form 

(J(x)) = j cr(x,x r ) £{x') d 3 x (604) 

where (Tij(x,x') is called the conductivity matrix. The FD relation states that the conductivity is proportional to 
the temporal FT of (Ji(x,t)Jj(x',t')) with respect to the time difference r = t — t' . The proportionality constant is 
1/(2T) in the DC limit. 

Nyquist Noise.— The FD relation in the electrical context is known as Nyquist theorem. It can be used "in reverse" 
in order to deduce the Nyquist noise v = 2GT, provided G is known from experiment. It should be clear that in 
non-equilibrium conditions we might have extra fluctuations, which in this example are known as shot noise. 

= [13.5] Forced oscillator and forced particle 

Consider a particle that is (say) bounded to a spring. This means harmonic oscillator that is characterized by frequency 
f2. The interaction term is — £{t)x. Accordingly, 

• The parameter in %{£) represents an electric held. 

• The generalized force is the polarization (x) of the particle. 

• The AC linear response relation is (x) = x( w ) £■ 

The FD relation implies that x has fluctuations at equilibrium, that are related to the imaginary part of the suscep- 
tibility: 



hu> 



2T 



1 



C xx (uj) = ftcoth — - Im x(w) , xM = 5 : ■ ^ (605) 



irjuj + mil 2 
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Note that the fluctuations of the velocity are C vv (oj) — uj 2 C xx (uj). From C xx (0) and C vv (0) one can deduce the 
average energy of the oscillator and verify that it is in agreement with the canonical ensemble. 

Forced particle.— The limit il — > of harmonic oscillator corresponds formally to a Brownian particle. Accordingly 
in the classical limit we get a Lorentzian 

<%» = (2T/m) x 2 { "> /m) (606) 

indicating a Drude type (exponential) velocity-velocity correlation function. The integral over the velocity-velocity 
correlation function determined the diffusion coefficient, namely 

D = \c vv {Q) = - (607) 

There is an optional shortcut in the application of the FD relation in the context of forced particle. Let us write the 
electric field as £ = —A. The interaction term is — A(t)v. Accordingly, 

• The parameter in H{A) represents the vector potential. 

• The generalized force is the velocity (v) of the particle. 

• The DC linear response relation (v) = /i £ describes drift motion. 

• The dissipation rate W = ji£ 2 describes Joule heating. 

The FD relation in this notations implies that v has fluctuations at equilibrium, that are related to the mobility 
/i = I/77. The "intensity" of the velocity fluctuations is 2D. Hence the classical FD relation implies that the ratio of 
the diffusion (D) to the mobility (/1) equals the temperature (T). This is known as Einstein relation. For more details 
see the Langevin section. 



[13.6] The fluctuations of an Ohmic system 



Sometimes it is convenient to characterize the system by its response, and from this to deduce the power spectrum of 
the fluctuations. So we regard K(u>) as the input and write 

C{u) = Jcoth^)lm[ivH] (608) 

The so called Ohmic power spectrum is 

i^hm.cH = i2r,u (610) 

C ohmlc M = v hio coth(^j (611) 

&—M = qfr" i_ c -iwr (612) 

for which the friction coefficient is r\ — const, up to some implicit high frequency cutoff w c , and the intensity of 
fluctuations is v = 2i]T. 



[13.7] Many body fluctuations of Fermions 



It is interesting to see how an Ohmic behavior is deduced for a low temperature system of Fermions. We consider the 
fluctuations of a general observable A. If we treat the many body system as a whole then we have to employ second 
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quantization to write A = 'Yl imn ^mn<An a n- Excluding the irrelevant diagonal n=m terms we get for a non-interacting 
system in a thermal state 



§W( U ) = FT^A(t)A(0)^ (613) 

= ^ \A mn \ 2 (aia m atn a n) 2ir6(co-(E m -E n )) 

nrn 

= J2(l-f(E m ))f(E n )\A mn \ 2 2Tr6(u-(E m -E n )) 

nm 

W/A C EF ( U ) 



1 - e^' T 



where the last expression is obtained if the single particle power spectrum is energy independent in the range of 
interest around the Fermi energy. Note that, at zero temperature, the first factor becomes (w/A)0(w), cutting off 
all contributions at negative frequencies. Physically, this represents the fact that a zero-temperature fermion system 
can only absorb energy. If it couples to a zero-temperature environment, there will not be any transitions at all, in 
contrast to what would be deduced from the single-particle spectrum alone. 

=== [13.8] The FD relation for a system of Fermions 



For one particle we can define the single particle microcanonical fluctuation spectrum Ce(w), and the canonical 
fluctuation spectrum Ct{u). If we have many body occupation by N Fermions, we can express the additive K(us) 
using the one-particle correlation functions (The commutator is additive and does not care about the Pauli exclusion 
principle) : 

f>[N]/ \ _ ■ I &(Ef) Ce p {u) degenerated Fermi occupation (k\a\ 

1 (N/T) Ct{u>) dilute Boltzman occupation 

Once we know K^ (u) we can calculate the dissipation coefficient using the Kubo formula: 

v W(u) = ^-K^{u) (615) 

The fluctuations of the many body current are deduced from the canonical FD relation: 



H , /fiw\ 



2 V 277 



hto , / hui 



&* M = '-: coth ( Z ) im | &m m | = Z coth ( Z) { (T/ ^c% {Uj) ( 616 ) 



2T \2T 



Note that for the non-symmetryized power spectrum S^ N ' (to) we have the same expression with 

cothf^ ^ ^ (617) 

It is important to observe that the intensity v of the current fluctuations is not simply the one-particle v times the 
number of particles. Rather, for low temperature Fermi occupation, the effective number of particles that contribute 
to the noise is T/A where A is the mean level spacing. Needless to say that the result for 5^1 (to) that has been 
deduced here from the FD relation is consistent with the direct calculation utilizing Fock space formalism of the 
previous section. 
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= [13.9] The fluctuations of the potential in metals 

The dielectric constant of a metal is defined via the linear relation between the total electrostatic potential t/ total and 
an external test charge density /9 oxt 



U total 



1 



e(9, 



4ire z 
„2 ) ^ ext 



(618) 



For simplicity we relate here and below to one component q of the fields. The total electrostatic potential is the sum 
of the external potential L/ cxt = (47re 2 /g 2 )p oxt , and the induced potential U alct = {Aire 2 /q 2 )p B i ct , where p elct is the total 
density of the electrons. The dielectric constant can be deduced from the equations of motion dp alct /dt = — V J with 
J = — (cr/e 2 )VC/ total — DVp clct that leads to the relation 



(a/e 2 )q 2 



A'olct 


7—. n *- / total 

iu) — JJq z 




hence to C/ tot , 


a = (l/s)U ext , where 




e(g»w) = 1- 


47rcr 




iu — Dq 2 




3 that 






Im 


-1 

_e{q,u)_ 


Attctuj 


UJ 


(Dq 2 +47rcr) 2 +uj 2 


Aira 



(619) 



(620) 



(621) 



The interaction between the electrons and an external electrostatic field is described by "H ox , = U oxt p alct which can be 
also written as H txt — p oxt C/ olct . The fluctuation dissipation relation expresses S^ N ' (q, us) using the response function 
a(q,u>) that relates C/ olct to — p oxt which is 



a(q,w) = 

ly. 


47re 2 
q 2 


1- 


1 


e(q,uj)_ 



(622) 



S [N] (q,Lu)=lm\a(q,Lu) 



1 - c-"/ T 



(623) 



leading to 



o- 9 2 V 1 



2w 



3 -w/T 



(624) 



The Ohmic behavior is cut-off by \u>\ < l/r c and |q| < 1/^ where t — v f t c is the elastic mean free path, and w F is the 
Fermi velocity. Recalling the Einstein relation a = e 2 vD 1 where v = A -1 / L d is the density of states per unit volume, 
we can write this result more conveniently as follows: 



S [N \q^) 



1 



vDq 2 \\ 



'2uj 



3 -u/T 



(625) 



Note that the electron charge e cancels out from this final result for the Nyquist noise spectrum. This well-known 
fact is due to the effects of screening: A larger value of the charge would be canceled by a correspondingly stronger 
suppression of density fluctuations. 
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System interacting with a bath 
[14] The modeling Hamiltonian 

= [14.1] The Born-Oppenheimer Hamiltonian 



We first discuss system that is coupled to some other degrees of freedom that can be eliminated using an adiabatic 
scheme. This leads to the Born-Oppenheimer picture. It is strongly related to Linear response theory and the 
presentation below is arranged accordingly. 

Linear response theory is the leading formalism to deal with driven systems. Such systems are described by a 
Hamiltonian 

H=U{Q,P;X(t)) (626) 

where (Q,P) is a set of canonical coordinates (in case that the Hamiltonian is the outcome of "quantization"), and 
X(t) is a set of time dependent classical parameters ("fields"). For example, X can be the position of a piston. In 
such case X is its velocity. More interesting is the case where X is the magnetic flux through a ring. In such a case 
X is the electro motive force. The Kubo formula allows the calculation of the response coefficients. In the mentioned 
examples these are the "friction coefficient" and the "conductance of the ring" respectively 

In the limit of a very slow time variation (small X), linear response theory coincides with the "adiabatic picture". In 
this limit the response of the system can be described as a non-dissipative "geometric magnetism" effect (this term was 
coined by Berry and Robbins). If we increase X beyond a certain threshold, then we get Fermi-goldcn-rule transitions 
between levels, leading to absorption of energy ("dissipation"). Then linear response theory can be regarded as a 
generalization of "Ohm law" . 

The Born-Oppenheimer picture allows to deal with Hamiltonians of the type 

«tot.i - n (x,p) + H(Q,P;x) (627) 

Here we replaced the parameter X{t) by a dynamical variable x. The standard textbook example is the study of 
diatomic molecules. In such case x is the distance between the nuclei. It is evident that the theory of driven systems 
is a special limit of this problem, which is obtained if we treat x as a classical variable. For presentation purpose let 
us consider the Hamiltonian 

w total - ^mY<$ + n (Q> p -> x ) ( 628 ) 

3 

We define the basis \x, n(x)) — \x) (g> \n(x)) 1 and expand the state as 

I*) = E**(*) MO")) ( 62Q ) 

Using 

(x,n(x)\ / H\xo,m(xo)) = 8(x-xo) x 6 nm E n (x) (630) 

(x,n(x)\pj\xo,m(xo)) = {-idjS(x-xo)) x (n(x)\m(xo)) = -idjd(x-xo)5 nm - 8(x-xo)A J nm (x) (631) 

we deduce that pj i->- — idj — A 3 nm (x), and the Hamiltonian can be written as 

W total = JL^fo-^a:)) 2 + E(x) (632) 

3 
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The adiabatic approximation is obtained if one neglects the n 7^ m terms that couple the motion on different energy 
surfaces. These couplings are responsible to the dissipation effect. 



= [14.2] The bath model Hamiltonian 

The Hamiltonian of a system that interact with and environment is conveniently arranged as 

Wtotai = Ho{x,p) + H{Q,P;x) 
For an interaction with a general (possibly chaotic) environment we write 



(633) 



n, 



Ho(x,p) + xB 



(634) 



where E = {E n } is the bath Hamiltonian that can be written is some diagonal representation, while B = {B nm } 
represents that interaction term with x. However, it is more conventional to consider an interaction with an harmonic 
bath: 



H (x,p) + U{x,Q a ) + H blltll (Q a ,P a ) 



(635) 



where (say) 



H (x,p) 



f 



p 2 + V(x) 

p a 2 , 1 

^\2m Q 2 



^ ' ^- - >a«SQa 2 



(636) 
(637) 



and the interaction is (say) 
^zcl — x y ^ c a i^ a 

a 

Wdld = / CaWaUyX X a j 



(638) 
(639) 



where the DLD possibility is appropriate for the motion of a particle in an extended environment. 




dissipation 



fluctuations 




driving source 

"slow" DoF 

"system" 



driven systei 

"fast" DoF 

"environmen 
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[14.3] Harmonic bath 



It is common to model the environment as a huge collection of harmonic oscillators, and to say that the system if 
subject to the fluctuations of a field variable J- which is a linear combination of the bath coordinates: 



T 



/ C a (^/ a — > C Q 



1 



1/2 



(a Q + aJJ 



2m Q cjQ, 
For preparation of the bath in state n = {n Q } we get 

5 M = ^^c* |<n Q ±l|Q a |n a >| 2 27r<J(wTWa 



(640) 



(641) 



a ± 



Using 



(n a +l|Qa|n a ) = 
(n a -l\Q a \n a ) = 



1/2 



\/l + n a 



1/2 



we get 



s(w) = E 



-2ttc" 



(l+n a )(5(w — u a ) + n a S(uj + to a ) 



(642) 
(643) 

(644) 



For a canonical preparation of the bath we have (n a ) — f(u) a ) = l/(c w / T — 1). It follows that 
SM = 2J( M )x|^+^ = 2J( W ) r -^ E7 



(645) 
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where we used /(— to) = — (1 + f(u>)), and denned the spectral density of the bath as 
z * — ' m Q u; Q 

a 

with anti-symmetric continuation. For an Ohmic bath J(u>) — rjco, with some cutoff frequency lo c . 



(646) 



===== [14.4] Spin bath 

We define T 

a a 

Thus Q a is the first Pauli matrix. Its non-trivial matrix elements are 

(n a -l\Q a \n a ) = y/n^ 
(n a +l\Q a \n a ) = \f\ -n a 

In complete analogy we get 

S(u>) = 2_. ^^Ca (l—n a )S(u> — u> a ) + n a S(uj + U! c 

a 

For canonical preparation (n a ) — f{uj a ) where (from here on h = 1) 

1 



/M = 



Thus we get 



e^ + 1 



1 



I + c-0' 



t=i-/M 



S(w) = 2J(M)xJ (1 fW = 2J(w)- ^ 

and 

C(w) = J(w) 
where we define 

J(w) = 7r^c^(w-w a ) 

a: 

with symmetric continuation. For Ohmic bath J(w) = v, with some cutoff frequency w c 



(647) 



(648) 
(649) 



(650) 



(651) 
(652) 



(653) 



(654) 



(655) 



[14.5] Spatially extended environment 



In this section we describe fluctuations of an extended environment in space and time using the form factor S(q,co). 
We define 



S(q,u) = FT (U(x 2 ,t 2 )U(x ll t 1 )) 



(656) 



where the expectation value assumes that the bath is in a stationary state of its unperturbed Hamiltonian. The force- 
force correlation function is obtained via differentiation. In particular the local power spectrum of the fluctuating 
force is 



S ( u ) = / 7r9 2 S(q,u) 

Ztt 



and the intensity of the fluctuations at a given point in space is 



(657) 



v = S(lu=0) 



dq 2 



2tt 



q A S(q,LO=Q) 



(658) 



For the one dimensional DLD bath we get 



U 



/ C-^Kc^ o,Vj\X X a ) 



(659) 



Taking into account that the oscillators are independent of each other we get 

(U(x 2 ,t 2 )U(x 1 ,t 1 )) = y^^c 2 a (Q a (t 2 )Q a (ti))u(x 2 -x a )u(x 1 -x a ) 



= dx 



^ c 2 a (Q a (t 2 )Q a (ti))6(x - x a ) 

u(x2— x)u(xi— x) dx S(t 2 — t\) 
w(x 2 - x\) S{t 2 - ti) 



u(x 2 —x)u(xi—x) 



(660) 

(661) 

(662) 
(663) 



Where we have assumed homogeneous distribution of the oscillators, and S(t) is defined implicitly by the above 
equality With the convention w"(0) = —1 it is identified as the local force- force correlation function. Consequently 
we get for the form factor 



S(q,u) = FT (W(a; 2 ,t 2 )W(a; 1 ,t 1 )) = w(q) S(w) 



As an example we may consider the following correlation function: 



(664) 



w(r) 



' exp 



I /r\2 



2 V^ 



(665) 



If the spatial correlation distance is very large we get ZCL model: 

1 2 

w(r) — const r 

' ' 2 

leading to 



(666) 



S(q,w) 



4s(q) S(co) 



(667) 



This means that the force is homogeneous in space, and fluctuates only in time, which is effectively the case if a 
particle or an atom interacts with long wavelength modes. 
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[15] Stochastic picture of the dynamics 

There are various " levels" in which the dynamics of a non- isolated system can be treated. We start with the random 
walk problem that can describe the motion of a Brownian particle in the absence of friction. The we discuss the 
Langcvin equation where friction is included. The dynamics in the above problem is described by a diffusion equation 
and Fokkcr-Planck equation respectively. More generally we can talk about Master equations and in particular their 
simplest stochastic version which is known as rate equations. 

= [15.1] Random walk and diffusion 



Consider a particle that can hope from site to site in a stochastic manner. Each step can be represented by a random 
number f t = ±a, where a is the lattice constant and t is the integer time index. The total displacement is 

t 
x(t)-x(0) = £/(*') (668) 

t'=0 

Assuming a stationary stochastic process in which the correlation function is 

(/(*i)/(* 2 )> = C(h-t 2 ) (669) 

we get that the variance is 

t t t +t' t 

Var[x] = ]T]T(/(ii)/(i 2 )) = ]T £ C(r) = £ D(t') -► 2Dt (670) 

i 1= 0i 2 =0 f=0T=-t' t'=0 

where the asymptotic value of the diffusion coefficient is 

_. oo 

D = 2 S C ^ ( 671 ) 

r — — oo 

Most significant is to realize that there is a continuum limit of the random walk problem where the dynamics is 
described by the following " Langcvin" equation of motion 

x = f(t) (672) 

and accordingly 

D = \ f°° C(r)dr - \c{w=Q) (673) 



There are various generalizations of the random walk problem, where the dwell time or the size of the steps are 
random variables, leading in general to sub-diffusive or super diffusive behavior respectively. The latter case is known 
as Levi- flight. 

In the random walk problem the stochastic dynamics can be described by an equation for the time evolution of the 
probabilities p n to find the particle in site n. This has the form of a "rate equation". In the continuum limit it 
becomes the "diffusion equation" for the probability density p{x) which we describe below. More generally this type 
of "master equation" is known as the "Fokkcr Planck equation" which we discuss later on. 
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= [15.2] The diffusion equation 

It is natural to act what is the " master equation" that describes the time evolution of the probability density p t (x) in 
the case of a diffusion process. We assume that the stochastic equation of motion is x = f(t) with stochastic f(t) that 
has a zero average. A trivial generalization is to include a drift term such that the equation is x = u + f(t), where 
u is the soc-called drift velocity. In order to derive the diffusion equation, note that for any particular realization of 
f(t) the probability pt+dt( x t+dt)dxt+dt must equal pt(x t )dx t . Since the phase space element preserves its volume one 
obtains the Liouville equation (d/dt)p t (xt) = 0, from which one deduces the continuity equation 

|p t (x) = _(„ + /(t))^ (674) 

From this equation it follows that pt a +dt can be expressed as an integral that involves pt> within to < if < to + dt. 
The equation can be solved iteratively. In order to simplify notations we set without loss of generality to = and 
t = to + dt. Consequently we get an expansion that involves nested terms with higher order d/dx derivatives of po- 
For sake of clarity we drop the drift term and write 

Pt = Po - i dt'fit') 9 ^ + / dt'f(t')[ dt"f(t") 3 ^l + higher order terms (675) 

Jo &x Jo Jo ux 

Averaging over realizations of /(), and neglecting the 0(dt 3 ) terms, one obtains a diffusion equation, to which we add 
back the drift term: 

d dp „d 2 p /„„„x 

m p = ~ u ix +D J (676) 

The neglect of the 0(dt 3 ) terms is justified in the limit where the correlation time goes to zero. This is sometimes 
known as the Markovian approximation. It is possible to regard the diffusion equation as a combination of a continuity 
equation 

m*w = -i 1 ^ (677) 

with expression to the current that includes a drift term and a diffusion term. The diffusion term is known as Fick's 
law. The drift velocity is typically related to a the gradient of an external potential, u = —pV'(x), with a coefficient 
which is called mobility. Accordingly we write 

d dV dp 

I(x) = up t (x) - D— p t (x) = -W-q^- d -£ (678) 

If this expression is applied to a system in canonical equilibrium with p{x) ex exp(— /3V(x)), it follows from the 
requirement I(x) = that p = (1/T)D. This is called Einstein relation. It is useful in semiconductor physics. 
For electrons in metal it is common to define the conductivity a = pp, and postulate that at equilibrium 
p(x) = J dEg(E — V(x))f(E — Ep). It follows that the Einstein relation for metals is a = g{Ep)D. Note that 
g(Ep) is defined here as the density of one-particle states per unit volume, and it is proportional to pjEp. 

Fick's law can be explained heuristically as reflecting non-zero net current due to a difference in currents at points 
x ± £, where £ is the mean free path. Ignoring the drift, if we have a sample of length L with a steady state current 
then 



L 



p(L) - p(0) (679) 



This means that there is a strict analogy here to Ohm law, implying that D is formally like the conductivity of the 
chain, and accordingly can be obtained from a resistor network calculation. This observation is useful in analyzing 
diffusion is non-homogeneous networks. 
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[15.3] The Langevin equation 



Consider a test particle subject to a homogeneous but fluctuating field of force J 7 , leading to stochastic dynamics that 
is described by the Langeving equation mx = T . It is convenient to isolate the average (= "friction") term from J 7 , 
and accordingly to redefine J 7 as a stochastic variable (= "noise") that has zero average. Consequently the Langevin 
equation is written as 

mx = -r]x + T(t) (680) 

where J 7 is a stochastic variable that satisfies 

(T(t)) = (681) 

(T(t 2 )T(h)) = C(t 2 -h) (682) 

It is assumed that C{t) has a short correlation time. We are interested in the dynamics over larger time scales (we 
have no interest to resolve the dynamics over very short times). We also note that if J 7 were a constant force, then 
the particle would drift with velocity {l/rf)J-. The coefficient [i = 1/rj is called mobility. The equation for the velocity 
v = x can be written as 

4e ( " /m)4 «(t) = -c("/ m )*J-(i) (683) 

at m 

leading to the solution 

i rt 
v(t) = - dt'e- {T > /m ^ t - t ^(f) (684) 

''' J —CO 

We see that T n = m/rj is the damping time. After time 3> t v the initial velocity is forgotten, hence the lower limit 
of the integration can be extended to — oo. Evidently the average velocity is zero. We turn now to calculate the 
velocity- velocity correlation. "Squaring" and averaging over realizations we get 

(v{h)v(h)) = \ f 1 [' dt'dt"e-^ /m){tl+t2 - t '- t " ) C(t' - t") (685) 

We treat C(t' — t") like a delta function. Then it is not difficult to find that 



W'M^ - 2^ C " ( " /m)|ta " tll £ ) C(T)dT = m(i) 



-|*2-*i|/t-»j 



(686) 



The velocity velocity correlation function should coincide (for t\ = t 2 = t) with the canonical result (^mv 2 ) = |T. 
From this one deduces an FD relation v/(2rj) = T with regard to the response characteristics of the bath. The 
displacement x(t) — x(0) of the particle is the integral over its velocity v(t'). On the average it is zero, but the second 
moment is 



v 



((x(t) - x{0)) 2 ) = / / dt'dt"{v{t")v{t')) = -j x t = 2Dt (687) 

^o Jo V 

Hence we have diffusion in space. From the above we deduce the Einstein relation 

— = — = Temperature (688) 

The last equality is based on FD relation. 
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[15.4] The Fokker- Planck equation 



As in the case of a "random walk" one can ask what is the "master equation" that described the evolution of the 
probability density p(x,p). This leads to the Fokkcr-Planck equation. The derivation is the same as in the case of a 
diffusion process. Here the diffusion is in momentum with a coefficient v/2. Including the v(p) = p/m drift in the 
position, we get the continuity equation 



d_ _ _8_ 
dt dx 



vp 



d_ 

dp 



V'{x)p-r]vp- l ^ 1 P\ (689) 



2 dp 
There are quantum generalizations of the Fokkcr-Planck equation which wc discuss in a separate section. 

= [15.5] Rate equations 

A rate equation is merely a discrete version of the diffusion or Fokkcr-Planck equation. It can be regarded as describing 
a generalized "random walk" problem, where the transition rates W nm arc not necessarily equal in the n <— > m and 
m h- > n directions. In the context of the "system-bath" paradigm it is common to model the system as a set of levels 
{E n } with transition rates 

W ™=<m+ 1 + c(En ™ m)/TB (690) 

where w e nm and w^ m are the elements of symmetric matrices. The first term describes the transitions that are induced 
by the Ta=oo driving source, as in the standard random walk problem. The second term describes the bath induced 
transitions, with ratio e( E n- E m)/TB f n <^> m transitions, as required by detailed balance considerations. 



id a 



One can regard w^ m as the " noise" which is introduced into the system by the bath, while the difference W nm — W-, 
is the friction. However this point of view is strictly correct only for constant density of states. If the level density 
grows with energy there will be a heating effect even if the Tg=oo. 

The dynamics of the population probabilities p = {p n } is described by a rate equation 
dp. n 



dl 



= J2 i W nmPm - WmnPn] (691) 



This equation can be written schematically as p = Wp. There is an implicit conservation of probability requirement 
J2p n = 1- The diagonal elements of W arc — T„, with the decay rates T n = J2 m W mn - The steady state is determined 
from the matrix equation Wp = 0. In the presence of driving the detailed balance is disturbed leading in general to 
a non-canonical steady state. 

Diffusion.— It is useful to note that a rate equation that describes motion in ID, with transition rates that depends 
on the hopping distance, leads to diffusion with a coefficient 

D = - Y^ r 2 w(r) (692) 

r— — oo 

In particular for near neighbor hoping D — w. Here the distance between the sites is assumed to be unity. If we 
discretize a diffusion equation by slicing the x axis into cells of width dx, the effective hopping rate w should be chosen 
such that D = wdx 2 . 

Two level system.— There is an elegant way to write the general rate equation for a two level system. Defining 
S z — p+ — p-, and recalling that p + + p- = 1, one realizes that the two equations for p + and p_ can be written as a 
single equation (whose full quantum version will be discuss in the next lecture): 

S z = (W+- - W_ + ) - (w + _ + W- + )S Z = -—{S Z - Seq) (693) 
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[16] Quantum master equations 
= [16.1] General perspective 



The description of the reduced dynamics of a system that is coupled to a bath using a Master equation is commonly 
based on the following working hypothesis: (i) The bath is fully characterized by a single spectral function, (ii) There 
is a way to justify the neglect of memory effects. The latter is known as the Markovian approximation. In particular it 
follows that the initial preparation, whether it is factorized or not, is not an issue. If the master equation is regarded 
as exact description of the reduced dynamics it should be of the Lindblad form. Otherwise is should be regarded 
merely as an approximation. 

There are two common approximation schemes: (A) In the Microscopic regime of atomic physics (e.g. two level atom) 
it is assumed that the bath induced rates are much smaller than the level spacing, and a "secular approximation" is 
employed. (B) In the Mesoscopic regime of condense matter physics (e.g. Brownian motion) it is assumed that the 
bath is Ohmic, and accordingly its effect can be treated as a generalization of "white noise". 

In the following presentation we assume that the full Hamiltonian is 

Wtot.1 = n-WF bMh +n bM (694) 

Neglecting the interaction, the bath is characterized by the spectral function 

C(u) = FT[(F(i)F(0))l (695) 

and the convention (F(t)) = 0. Whether the bath is composed of harmonic oscillators or not is regarded by the 
working hypothesis as not important. There is a well known discussion of this point in Feynman- Vernon paper. 

The spectral function C{u>) is characterized by temperature T and by a cutoff frequency ui c . The latter is assumed 
below to be large compared with any other temporal frequency. What we call "noise" means C(—u>) = C(ui). What 
we call "finite temperature" means 

C(-lu)/C{lu) = exp(-w/T) (696) 

What we call "white noise" or "infinite temperature Ohmic bath" corresponds to C(w) = v, leading to 

C(t) = (F(t)F(0)) = t/6(t) (697) 

What we call "high temperature Ohmic bath" takes into account that C(u>) possesses an antisymmetric component 
that at small frequencies can be approximated as v x [cu/(2T)]. Consequently 

C(t) = (F(t)F(0)) = t/6(t)+iri6'{t) (698) 

where r\ = i//(2T) is the so called friction coefficient. 

= [16.2] Quantum Fokker- Planck equation 

The best known of reduced dynamics is known in the classical context as the Langevin equation. Using units such 
that the mass is unity, and p is the velocity, the stochastic equation is p = —V'(x) — rjp + /(£), where /(£) is white 
noise. The corresponding mater equation for p(x,p) is known as the Fokkcr-Planck equation: 



dp d 

dt dx 



Pl> 



d_ 
dp 



V'(x)p-m>p-^] (699) 
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This equation can be written with Poisson Brackets — i[A, B] as 

f = -i[H,p] ^[x,[x,p]]-il[x,{p,p}] (700) 

The same procedure to eliminate F(t) can be used in the quantum case (see section below), and the same equation 
is obtained with x replaced by X — W and p replaced by P = i[H, W) 

f = -W,p] ^{X,[X,p}]-q[X,{P,p}} (701) 

One should notice that this equation is not Lindblad form (see section below). The following term is missing: 

2 

missing term for Lindblad = - — [P, [P, p\] = — [P, [P, pi] (702) 

8^ 16T 

The three bath terms of the Fokkcr-Planck equation can be regarded as an expansion in powers of (Q/T), where 
f2 is the frequency of the motion. Accordingly in the high temperature regime the deviation of Fokkcr-Planck from 
Lindblad form is small. 

= [16.3] Derivation for white noise 

We demonstrates the derivation of the Master equation in that case of white noise. The Hamiltonian is 
H(t) = H + f(t)W, were /(£) represents white noise: that means that upon ensemble average (f(t)) = 0, while 
(f(t)f(t')) = v8(t — £'). Given p(t) = p, the Liouville von-Neumann equation can be solved iteratively to determine 
p(t + dt) , where dt is a small time interval: 

rt+dt rt+dt nt' 

p(t + dt) = p-i dt , [H(t , ), P }- / dt'dt"[H(t'),[H{t"), P }} + ... (703) 



Averaging over realizations of /(£) all the odd orders in this expansion vanish, while the leading dt contribution comes 
only from the zero order term that involves % and from the second order term that involves W . Consequently we get 
the following Master equation: 

^ = -i[n,p]- l -y[W,[W,p}} = -i[U,p]- l -{T,p} + vWpW (704) 

where V = vWW . Note that the first two terms in the second expression generate so called non-Hermitian dynamics 
with the effective Hamiltonian H o!f = % — (i/2)T, while the last term represents "continuous measurement". 

= [16.4] Derivation for Ohmic bath 

The generalization of the "white noise" derivation for a system that is coupled to a high temperature Ohmic bath is 
straightforward. It is based on the assumption that at any moment the system-bath state is " factorized" , which can 
be justifies if ui~ l is sufficiently small. Using the same procedure as for "white noise" one obtains: 

-j- = -i[H,p\ + W + pW + W P W- -WW + p-pW~W (705) 

where 

/•oo 

W ± = C(±t)W(-t)dt, W(t) = e mt We- im (706) 

Jo 
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For high temperature Ohmic bath we define W = X and P = i\H, X] and hence 



W ± = ^X±i^P (707) 

Consequently we get the Fokker-Planck equation 

f = -i[H,p] ^[X,[X,p]]-i^[X,{P,p}] (708) 



= [16.5] Comparison with Lindblad form 

A term in the Master equation is of Lindblad form if it can be written as: 

Lindblad = WpW ] - -W^Wp - -pW ] W (709) 



Writing 

W = A + iB (710) 

C = i[A,B] (711) 

D = (1/2){A,B} (712) 

and using the identity 

[A,{B,p}\ = l -[D,p] - l -{C,p}+A P B - BpA (713) 

we get the following optional expressions for the Lindblad term: 

Lindblad = -i[D,p] - -{A 2 + B 2 ,p} + ApA + BpB-i[A,{B, p}} (714) 

= -i[D,p}- 1 -[A,[A,p)]-^[B,[B,p)]-i[A,{B,p}} (715) 

The first term represents "Lamb shift", the second and the third are "noise" induced diffusion terms, and the last is 
the "friction" term. Note that the Fokker-Planck equation would be of this form if a positional diffusion term were 
added. 

^^^^= [16.6] The general Lindblad form 

A master equation is of Lindblad form if it can be written as 

f t = -i[H,p] +J2[W r ]p[W^ -i[I> + P r], T = J2lW r }HW r ] (716) 

r r 

Lindblad equation is the most general form of a Markovian master Equation for the probability matrix. The heart of 
the argument is as follows: The most general linear relation between matrices is 

p af 3 = ^W(a/V/3'W' (717) 

a'p' 



96 

This linear transformation is required to preserve the hermiticity of p. Changing notation to W(a/3\a' ft) = W aa ' ,pp> 
one observes that W aa ',i3i3' should be hermitian, with eigenvalues w r . Accordingly we can find a spectral decomposition 
with a transformation matrix T(aa'\r). Changing notation to W£ a , ~ y /vx r T(aa / \r) we get the Lindblad from 

X^I^I/ 9 !^ 7 "]^- Now we have to distinguish between two cases: If p is the new probability matrix, then conservation 
of trace requires ^^W^^W 1 "] = -*-• ^ on ^ ne other hand we look on the incremental change p, then (after some 
non-trivial algebra) we arrive to the Lindblad from of the Master equation. There is a complete analogy here with 
the Pauli equation where the decay rates in the main diagonal of W have to counter-balance the ingoing transitions. 

= [16.7] The secular approximation 



We start again, as in the Mesoscopic case, with 

-j- = -i[H,p\ + W + pW + WpW- -WW + p- pW~W (718) 

We can write W(t) as ~J2 nm \n)W nm e l ( En ~ Em ^ t {'m\ or schematically as ^2^G~ int A^. We obtain 

W ± = ^2C ± {Q)A CI , Sl = -(E n -E m ), A a = \n)W nm (m\ (719) 

Q 

The secular approximation means that we keep only "resonant terms". For example in W + pW we keep only the 
terms C + {Vt) A^pA^. Consequently 

f t = - i {H,p}+Y,[(C + (n) + C-(n))AnpAl 2 -C + (n)Al 2 A n p-C-(n)pAUn} (720) 

n 

This approximation will be better explained in the next section. Note that the real part of ^ ± C ± (fl) is the spectral 
intensity C(Q) that drives the FGR transitions: 

w nm — (FGR transition rate from mton) = C(—(E n —E m )) x iWnm] 2 (721) 

The imaginary part of C (£Y) determines the Lamb shifts. 

= [16.8] The Pauli Master equation 

In atomic physics applications the induced rate of transitions w becomes much smaller compared with the Rabi- 
Bloch frequency £1 of the coherent oscillations. Then we can ignore all the highly oscillating terms that involve the 
off-diagonal terms, because they average to zero. Consequently we get the so-called Pauli master equation for the 
probabilities p n 

a i^ 1 Wl2 

-£ = Wp, W=\ W2i -r 2 ... j (V22) 



In this equation the W matrix is in agreement with the FGR picture. Namely, the rate constants are w nm — v\W nm \ 2 , 
and the decay constants are r„ = ^ m w nm . It is convenient to exclude here the diagonal elements of W, and to treat 
them later separately. 

If we have an interaction with a bath, then the ratio w nm /w mn is not unity, but would favor downwards transitions. 
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As for the off-diagonal terms: with the same reasoning we regard the p nm as independently oscillating with different 
frequencies, i.e. with negligible cross interaction. Consequently we get 



U>Pnn 

dt 



i(E n - E m ) - 7„ m p nm , 7„ m = -(r„ + r m ) + -\W nn - W mm \ 2 (723) 



[16.9] The Bloch equation (NMR) 



Taking a two level system as an example, and adopting the NMR spin language, the Pauli equation takes the form of 
a Bloch equation for the so-called polarization or Bloch vector (S x , S y ,S z ). Note that S z = p+ — p- is the population 
probability difference. The Bloch equations are: 

f = -£<*-*J (724 > 



r 2 



, — [" X S\x,y rp ^>x,y ('25) 



where S ocl = (w^ — w |_)/(u^ + w ^). The rates for the diagonal relaxation and for the off-diagonal transverse 

depahsing are: 

— = w+- + w-+ = 2w (726) 

Y = 2( w +-+ w -+)+"f V = 7 (727) 

The pure dephasing rate j v originated from the diagonal elements of W nm and hence is formally proportional to 
(7(0), while the FGR transition rates originate from the off-diagonal elements of W nm , and hence are proportional to 
C(±£Y), were ft = \E + — E_\ is the level spacing. Note that the transverse components satisfy the equation 

S + jS + n 2 S = (728) 

which leads to damped frequency O eff = \/tt 2 — (j/2) 2 . 
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Additional topics 
[17] The kinetic picture 

= [17.1] The Boltzmann distribution function 



The number of one particle states within a phase space volume is dAf = d 3 rd 3 p/(2irti) 3 . The occupation of this phase 
space volume is: 



drdp 

' {2irhY 



dN = f(r,p)^- 3 (729) 



where f(r,p) is called Boltzmann distribution function. In equilibrium we have 

f(r,p) = f(e p -n) (730) 

cq 

where /(e — fi) is cither the Bosc or the Fermi occupation function, or possibly their Boltzmann approximation. If we 
use (r, v) with measure d 3 rd 3 v instead of (r,p) we have 

f(r,v) = (£) 3 /(r,p) (731) 

By integrating over r and over all directions we get the velocity distribution 

F(v) = L 3 x W (£)' / Qmt, 2 - ^ (732) 

If we use Boltzmann approximation for the occupation function and express \x using N and T we get 

F{v) = N (£f) V2 ^z-\^[T (733) 

We note that 

N = //(2W /(r ' P) = faWk-ri = J F{v)dV (734) 



= [17.2] The Boltzmann equation 

The Liouville equation for p(r,p) is merely a continuity equation in phase space. It can be written as (d/dt)p = 
where d/dt unlike d/dt is the total derivative reflecting the change in the occupation of a phase space cell. The 
Boltzmann equation for f(r,p) is formally identical to the Liouville equation in the absence of collisions, and with 
collisions becomes (d/dt)p = g(r,p), where g(r,p) is the net rate in which particles are generated at (r,p) due to 
collisions. Accordingly the Boltzmann equation is 

O rj rj 

di + V{P) ' dx + • F(r) ' dpi f(r ' p) = ff(r ' p) (735) 

where v(p) = p/m is the dispersion relation, and J-(r) = —V'(r) is the force due to some external potential. The 
common expression for g{r,p) is based on 2body collision mechanism and "molecular chaos" assumption. The gas 
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reaches a steady sate in accordance with the Boltzmann H theorem. The formal solution for the steady state implies 
the Maxwcll-Boltzmann distribution for the velocities. 



= [17.3] The calculation of incident flux 

Given N gas particles that all have velocity v we can calculate the number of particles that hit a wall element per unit 
time (=flux), and also we can calculate the momentum transfer per unit time (=force). Using spherical coordinates, 
such that normal incidence is 0=0, one obtains (per unit area): 



J = 



P = 



\S\<tt/2 



|0|<tt/2 



dfLN 

dflN 
4tt V 



v cos(8) = 
vcos(8) 2mv 



1 
2 Jo 



cos(9)dcos(9) 



1 (N 



N 

\/ V = 4lV ,r 



cos 2 (6»)dcos(6») 



N 
V 



1 (N 



mo 



3 \V 



mv 



(736) 

(737) 



If we have the distribution F(v) of the velocities, or optionally if we are provided with the one-particle energy 
distribution, the total flux is given by an integral: 



<Ancidcn 



1 (F(v)dv 



o n V ) 7 4 V V 



1 fg(e)f(e)de 



(738) 



Similar expression holds for the pressure P, where one can make the identification e = (l/2)mv 2 , and recover the 
familiar Grand canonical result. 



[17.4] Blackbody radiation 



The modes of the electromagnetic field are labeled by the wavenumber k and the polarization a. For historical reasons 
we use k instead of p for the momentum and uj instead of e for the energy. The dispersion relation is linear u> = c\k\. 
The density of modes is 



*(oj)duj = 2 X 



V 



(2ttc) 



t4ttlo dcu 



(739) 



Recall that the canonical state of oscillators can be formally regarded as the grand canonical equilibrium of fi — 
Bose particles, with the occupation function 



1 



[Ilka) = 



g0UJka — \ 

For the total energy E we have 



= /(Wfe a ) 



(740) 



E = 



,oo 1 

uidui g(u>) /(w) = V / dui- 

I! Jo 



7T 2 C 3 V C 9 " - 1 



(741) 



For the total number of photons N we have a similar integral but without the uj. The flux of photons is obtained 
from the formula J = (1/4)(7V/V)c. From historical perspective we are interested in the flux of energy: 



incident [energy] = - ( — | <■ 



did 



1 



47T 2 ( 



2/3w _ ^ 



(742) 



Considering the thermal equilibrium between blackbody radiation from the environment, and the an object with a 
given temperature, detailed balance consideration implies that 



J,, 



(uj)duj = a(w)J incidont (uj)doj 



(743) 



100 



where the emissivity a(w) should equal the absorption coefficient of the incident radiation. From here we get the 
Planck formula 



J« 



iM 



a(uj) 



1 



47T 2 C 



2, .2 



2 I3lu _ I 



a(ui) 

47T 2 C 2 



T- 5 



e" - 1 



(744) 



where v = lu/T is the scaled frequency. Note that the peak of a blackbody radiation is at v w 3 which is known as 
Wcin's law. Upon integration the total blackbody radiation is 



J,, 



•/emitted (w)dW 



1 

47T 2 C 



14 



T 



(745) 



which is know as Stcphan-Boltzmann Law. Note that the units of the flux are energy per time per unit area. 




[17.5] Viscosity 



We have considered above the rate in which momentum is transferred to a wall due to ballistic collisions, leading 
to pressure. There is a somewhat related effect that is called "viscosity". It is simplest to explain the concept with 
regard to a gas whose particles have a short mean free path i, such that in equilibrium each gas particle has a diffusion 
coefficient D = vtI, where Vt is the mean thermal velocity. 

Assume out of equilibrium steady state in which the average velocity of the gas particles v(x, y, z) is in the x direction 
but its magnitude varies in the y direction. Due to the transverse diffusion there will be momentum transfer across 
the y = plane, which implies that the "upper" flow exerts a force on the "lower" region of the fluid (which is possibly 
the boundary layer of some "wall"). We shall explain below that if the area of the boundary region is A, then the 
force on it is given by the "stress-shear equation" 



F x = pA 



dv x 
dy 



(746) 



where p = pD is the viscosity, and p is the mass density of the gas. The argument goes as follows: Divide the y axis 
into layers of width dy. Define w, the effective transition rate of particles between layers, such that D = wdy 2 . Define 
the flow of momentum between layers as J{y) — [{pdy)v x {y)\ x w. Hence J(dy/2) — J{—dy/2) is the rate in which 
momentum is transferred across y = 0, leading to the desired result. 

The viscosity plays a major role in the NavierStokes equations. These equations describe the rate of change of the 
velocity field of as fluid due to momentum transfer. The equation included "pressure" term "viscosity" term and an 
optional external "force" term (say gravitation). Usually the equation is supplemented by a continuity equations for 
the mass, and for the energy, as well as by state equation the connects the pressure to the density. A well known 
result is Stokes law for the friction force that is exerted on a spherical object of radius R 



-6irpR v Bp 



(747) 



Note the the result depends on the radius and not on the area of the sphere. It holds for incompressible flow with 
very small Reynolds number. The simplest way to derive it, is to find the velocity field for a potential flow, as if there 
is no viscosity, but with no-slip boundary conditions, and then to use the "stress-shear equation". 
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[18] Scattering approach to mesoscopic transport 

The most popular approach to transport in mesoscopic devices takes the scattering formalism rather than the Kubo 
formalism as a starting point, leading to the Landauer and the BPT formulas. We first cite these formulas and then 
summarize their common derivation. This should be compared with the Kubo-based derivation of the previous section. 



[18.1] The Buttiker-Pretre-Thomas-Landauer formula 



We assume without loss of generality that there are three parameters (x\, X2,x$) over which we have external control, 
where x% — $ is the AB flux. The expression for the current I A that goes out of lead A, assuming DC linear response, 
can be written as 

I A = -J2G 3l x, (748) 

i 

where —±3 = — $ is the EMF, and therefore G 33 is the conductance in the usual sense. The Biittiker-Pretre-Thomas- 
Landaucr formula for the generalized conductance matrix is 

G^ = -^trace (p a ^-A (749) 

2m \ oxj J 

In particular for the Ohmic conductance we get the Landauer formula: 

p 2 
G 33 = — traced) (750) 

2nn 

In order to explain the notations in the above formulas we consider a two lead system. The S matrix in block form 
is written as follows: 

H.>'t~'0 <75i) 

where r and t are the so called reflection and transmission (sub) matrices respectively. We use the notation <j) ~ e$/fi. 
In the same representation, we define the left lead and the right lead projectors: 

The following identity is important in order to obtain the Landauer formula from the BPT formula: 

% - iUp a SP b -iP b SP a ) = iUP A S-SP A ) = -iUp b S-SP b ) (753) 

Another important identity is 

trace(P A 5P B 5 t ) = trace(tt t ) = ^2^2\t ab \ 2 (754) 

aGAbeB 

The traceQ operataion is over the channel indexes. 
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= [18.2] Floque theory for periodically driven systems 

The solution of the Schrodinger equation 

i^- = Hi}j (755) 

with time independent % is 

|^)>=]Tc-«|^)) (756) 

£ 

where the stationary states are found from 

U\4> {£) ) = £\4> {£) ) (757) 

Consider now the more complicated case where % depends periodically on time. Given that the basic frequency is oj 
we can write 

n(t) = j2 n(n)e ~ mu;t ( 758 ) 

The solution of the Schrodinger equation is 

oo 

|V(t))=2 Yl e- i{£+nw)t \il> {£ < n) ) (759) 

£ n— — oo 

where the Flouqe states are found from 

^^ (n -"' ) |V> (£ '™' ) ) = {£ + nu)\i/>V' n) ) (760) 

n' 

and £ is defined modulo uj. 



[18.3] The Floque scattering matrix 



In scattering theory we can define a Flouqe energy shell £. The solution outside of the scattering region is written as 



m))= E ^ i(£+Hi E 



n— rtfi 00 



A ,-ik an r _ R r +ik an r 

Jrvn.m. , c ±Jft.n, , C 



*-an j c J - J an 

' v an 



I "-- — ' tin , — 

'Van y^a 



|o) (761) 



where w an and k an are determined by the available energy £ + nco. The current in a given channel is time dependent, 
but its DC component is simply ^2 n (\B an \ 2 — |j4 a „| 2 ). Therefore the continuity of the probability flow implies that 
we can define an on-shell scattering matrix 

-C*bri b = y ^ Jbn b ,an a ^an a (762) 

an a 

We can write this matrix using the following notation 

S bnb , ana = SZ- na (£ + n a co) (763) 
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Unitarity implies that 

X>^.°« J 2 = ]T|S 6 "a(£)| 2 = l (764) 

fcnt bn 

Y.\ S *n b ,anf = ^ | S£ a (£ + n W ) | 2 = 1 ( 765 ) 

an a an 

If the driving is very slow we can nse the adiabatic relation between the incoming and outgoing amplitudes 

B b (t) = J2 s ba(X(t)) A a (t) (766) 

a 

where Sb a (X) is the conventional on-shell scattering matrix of the time independent problem. This relation implies 
that 

SUE) = ^[^ S ba (X(t)) e^dt (767) 

For sake of later use we note the following identity 

I>i s mI 2 = LS^ dtSbai §r 1Sba{x{t)) (768) 



= [18.4] Current within a channel 

Consider a one dimensional channel labeled as a. Let us take a segment of length L. For simplicity assume periodic 
boundary condition (ring geometry). If one state n is occupied the current is 

e 

Ia,n = jV a .n (769) 

If several states are occupied we should integrate over the energy 

I a = £/„(£„)!„,„ = [fa(E)^(h a ) = £- I J a {E)dE (770) 

n •" 1TVa n J 

For fully occupied states withing some energy range we get 

2 

la = ^(E 2 -E 1 ) = |-(V2-Vi) (771) 

If we have a multi channel lead, then we have to multiply by the number of open channels. 

= [18.5] The Landauer formula 

Consider a multi channel system which. All the channels are connected to a scattering region which is described by 
an S matrix. We use the notation 

g ba = \S ba \ 2 (772) 
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Assuming that we occupy a set of scattering states, such that f a (E) is the occupation of those scattering states that 
incident in the ath channel, we get that the outgoing current at channel b is 



I b =^ldE 



(y^gbafaiE)) -f b (E) 



(773) 



Inserting 1 = J^ a g^ a in the second term we get 



h= Y JdE 



Y,9baila(E)-f b {E)) 



(774) 



Assuming low temperature occupation with 



f a (E) = f(E - eV a ) « f(E) - f(E)eV a 



(775) 



we get 



h=-^J29ba(V b -V a ) 



(776) 



which is the multi channel version of the Landaucr formula. If we have two leads A and B we can write 



/„ = -: 



2tt 



,6eB a£A 



(V B - V A ) 



(777) 



Form here it follows that the conductance is 



2 2 

G =^T,T,9ba^ ^trace(P B SP A St) 



beB aeA 



(778) 



where P A and P B arc the projectors that define the two leads. 



[18.6] The BPT formula 



Assuming that the scattering region is periodically driven we can use the Floque scattering formalism. The derivation 
of the expression for the DC component lb of the current in channel b is very similar to the Landauer case, leading to 



h = 



2tt 



2-/r 



\ J dE (E \ S ba( E - ™)\ 2 fa(E + nw) j - fb(E) 
]T \S£ a (E - mj)\ 2 (f a (E - nu) - f b (E)) 

a,n 

J2\SL(E)\ 2 (fa(E)-fb(E + nu J )) 

a . n 

Y^n^SUE^t-f'AE)) 



dE 



dE 



e 

2^ 



Y,MSl\{E)f 



(779) 

(780) 
(781) 
(782) 
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In the last two steps we have assumed very small lu and zero temperature Fermi occupation. Next we use an identity 
that has been mentioned previously in order to get an expression that involves the time independent scattering matrix: 

A-^Esr**^*-™' <783) 

which implies that the pumped charge per cycle is 

Q = i^jdxY J Y. Si j^ s ^ x )^-j G ^ x ) dx ( 784 ) 



beB a 



G W = -^HY, %£W) = "^trace (pM#) (785) 



with 



beB a 

Note: since S(X) is unitary it follows that the following generator is Hcrmitian 

H(X)=i — S^ (786) 

The trace of a product of two hcrmitian operators is always a real quantity. 



[18.7] BPT and the Friedel sum rule 



If only one lead is involved the BPT formula becomes 

dN = -i— trace ( ^S+ ] dX (787) 

2tt \dX J K ' 

where dN is the number of particles that are absorbed (rather than emitted) by the scattering region due to change 
dX in some control parameter. A similar formula known as the Friedel sum rule states that 

dN=-i — tracef SUm dE 

2tt \dE 

where N(E) counts the number of states inside the scattering region up to energy E. Both formulas have a very 
simple derivation, since they merely involve counting of states. For the purpose of this derivation we close the lead 
at r — with Dirichlct boundary conditions. The eigen-energies are found via the equation 

det(S(E,X)-l) = (789) 

Let us denote the eigenphases of S as 6 r . We have the identity 

i^2S0 r = 5[ln dct S] = traced In S] = trace[SSS^} (790) 

r 

Observing that a new eigenvalue is found each time that one of the eigenphases goes via 9 — we get the desired 
result. 
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[19] The theory of electrical conductance 
= [19-1] The Hall conductance 



The calculation of the Hall conductance is possibly the simplest non-trivial example for adiabatic non-dissipative 
response. The standard geometry is a 2D "hall bar" of dimension L x x L y . In "Lecture notes in quantum mechanics" 
we have considered what happens if the electrons are confined in the transverse direction by a potential V(y). Adopting 
the Landauer approach is is assumed that the edges are connected to leads that maintain a chemical potential 
difference. Consequently there is a net current in the x direction. From the "Landau level" picture it is clear that the 
Hall conductance G xy is quantized in units e'(2irh). The problem with this approach is that the more complicated 
case of disorder V(x, y) is difficult for handling. We therefore turn to describe a formal Kubo approach. Form now 
on we use units such that e = h = 1. 

We still consider a Hall bar L x x L y , but now we impose periodic boundary condition such that ip(L x , y) = e l< ^ x il>(0, y) 
and ip(x,L y ) = e lc ^ y tp(x,0). Accordingly the Hamiltonian depends on the parameters (<f) x ,(j> y ,&B), where $b is the 
uniform magnetic flux through the Hall bar in the z direction. The currents I x = (e/L x )v x and I y = (e/L y )v y 
are conjugate to <j) x and 4> y . We consider the linear response relation I y = —G yx (f) x . This relation can be written as 
dQ y = —G yx d<f) x . The Hall conductance quantization means that a 2tt variation of <p x leads to one particle transported 
in the y direction. The physical picture is very clear in the standard V(y) geometry: the net effect is to displace all 
the filled Landau level "one step" in the y direction. 

We now proceed with a formal analysis to show that the Hall conductance is quantized for general V{x, y) potential. 
We can define a "vector potential" A n on the (cf> x , (j> y ) manifold. If we performed an adiabatic cycle the Berry phase 
would be a line integral over A n . By Stokes theorem this can be converted into a d4> x d<t> y integral over B n . However 
there arc two complementary domains over which the surface integral can be done. Consistency requires that the 
result for the Berry phase would come out the same modulo 2ir. It follows that 

1 r 27T r 2l! 



B n d(j) x d(j>y = integer [Chern number] (791) 

27r Jo Jo 

This means that the <j> averaged B n is quantized in units of 1/(2-71"). If we fill several levels the Hall conductance is 
the sum ^ n B n over the occupied levels, namely 

Gyx ^ ^ (E m -E n y (792) 

neband m 

If we have a quasi-continuum it is allowed to average this expression over (cf) x ,(f> y ). Then we deduce that the Hall 
conductance of a filled band is quantized. The result is of physical relevance if non-adiabatic transitions over the gap 
can be neglected. 

===== [19.2] The Drude formula 

The traditional derivation of the Drude formula is based on the Boltzmann picture. Optionally one can adopt a 
Langcvinc-like picture. The effect of the scattering of an electron in a metal is to randomize its velocity. This 
randomization leads to a statistical "damping" of the average velocity with rate 1/tg. On the other hand the electric 
field accelerates the particle with rate e£/m. In steady state the drift velocity is Vdrift = {e£/m)te, and the current 
density is J = {N /\l)evdrift leading to the Drude conductivity <x = (N /V)(e 2 /m)tg. Consequently the conductance of 
a ring that has a length L and a cross-section A is 

A N fe 2 \ 2 ( N \ £ e 2 k J frmn . 

G = y ff = To\ — )te = e r It = w-^M- 793) 

L L 2 \m) \mv F LJ L 2?rfi L J 

where £ — vpti is the mean free path at the Fermi energy, and M. is the effective number of open modes. Below we 
would like to derive this result formally from the FD relation. 
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The canonical version of the FD relation takes the form G = [1/(2T)]vt, where v is the intensity of the current 
fluctuations and G is the conductance. This is known as Nyquist version of the FD relation. One way to go is to 
calculate vt for a many body electronic system, see how this is done in a previous lecture. But if the electrons are non- 
interacting it is possible to do a shortcut, relating the conductance of the non-interacting many body electronic system 
to its single particle fluctuations. This can be regarded as a generalizations of the canonical Nyquist formula. The 
generalization is obtained by re- interpretation of f(E) as the Fermi occupation function (with total occupation N), 
rather than probability distribution. Assuming a Boltzmann occupation one obtains G*- N ' — \N / (2T)\vt ■ A similar 
generalization holds for a microcanonical occupation, from which one can deduce results for other occupations. In 
particular for low temperature Fermi occupation of non-interacting particles one obtains: 

G [N] = \%{E F )v Ep = \g{E F )(^f j{v^t)v\^))dt = (ffg(E F )V (794) 

The crossover from the high temperature "Boltzmann" to the low temperature "Fermi" behavior happens at 
T ~ E F . Assuming exponential velocity- velocity correlation function with time constant To, such that the mean 
free path is I = v f tq, we get Do = v F £. disregarding numerical prefactors the density of states can be written as 
g(E F ) — (L/v F )M, where M. is the number of open modes. From here we get the Drude formula 

m = (|)Vrf% = |X ,795) 

Relating to electrons that are moving in a lattice with static disorder, the mean free path can be deduced from the 
Fermi Golden Rule (FGR) picture as follows: Assuming isotropic scattering, the velocity-velocity correlation function 
is proportional to the survival probability P(t) = c~*'* £ . Ignoring a factor that has to do with the dimensionality 
d = 2, 3 of the sample the relation is 

(w(t)v(O)) « vl P{t) = vle- w/te (796) 

where the FGR rate of the scattering is 

^ = 2iTQ E \U mn \ 2 = — W 2 (797) 

In the last equality we have used \U nm \ 2 « [a /(ML)} W 2 , where a is the lattice spacing, and W is the strength of the 
disorder. Disregarding prefactors of order unity we deduce the so-called Born approximation for the mean free path: 



v E U 



KW <™> 



== [19.3] Formal calculation of the conductance 

The DC conductance G of a ring with N non-interacting electrons is related by Kubo/FD expression to the density 
of one-particle states g(E F ) at the Fermi energy, and to the C vv (u> ~ 0) fluctuations of velocity. The latter can be 
deduced semi-classically from the velocity-velocity correlation function, or from the matrix elements of the velocity 
operator using the quantum-mechanical spectral decomposition. Optionally one can use path integral or Green 
function diagrammatic methods for the calculation. 

Let us summarize some optional ways in which the Kubo/FD expression for the Ohmic conductance can be written. 
If we use the spectral decomposition with p n = g(E F )~ 1 d(E n — E F ), we get 

G = \g{E F ) (^j 2 C vv {0) = nJ2 (I)' \vnm\ 2 5(E n - E F ) 5(E rn - E n ) (799) 



n in- 
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It is implicit that the delta functions are "broadened" due to the assumed non-adiabaticity of the driving, else we 
shall get vanishing dissipation. The calculation of G is the adiabatic regime requires a more careful treatment, and 
possibly goes beyond LRT. As long as the broadening is large compared to the level spacing, but small compared with 
other energy scales, the result is not too sensitive to the broadening parameter, and small corrections are commonly 
ignored, unless the ring is very small. Schematically the above expression can be written as 

G = Tthg{E F ) 2 Q) 2 k m | 2 = 7rhg(E F ) 2 \2 nm \i (800) 

where the bar indicates that an average should be taken over the near diagonal matrix elements of the velocity operator 
near the Fermi energy. A somewhat more fancy way to write the same is 



G = tt trace 



1S(E F -H)1 5(E F -H) = - trace [1 Im[G(E F )] 1 lm[G(E F )]] (801) 

7T 



where G = 1/(E — Ji + iO) is the one-particle retarded Green function. This opens the way to formal calculations that 
are based on path integral or diagrammatic methods. 



For a chaotic ring, the dispersion \T nm \ 2 of the off-diagonal matrix elements is equal, up to a symmetry factor, to the 
dispersion of the diagonal matrix elements. Note that I nn = —dE n /d&. It is common to use the notation <p = (e/fi)$. 
Hence one obtains the Thouless relation: 



GW = factor x %- x -L 

h A 2 



dE n 



(802) 



where the numerical factor depends on symmetry considerations, and A is the mean level spacing at the Fermi energy. 
There is a more refined relation by Kohn. The Thouless relation is a useful staring point for the formulation of the 
scaling theory for localization. 



[19.4] Conductivity and Conductance 



Consider a ring geometry, and assume that the current is driven by the flux $. In order to have a better defined 
model we should specify what is the vector potential A(r) along the ring. We can regard the values of A at different 
points in space as independent parameters (think of tight binding model). Their sum (meaning § A{r)-dr) should be 
$. So we have to know how $ is "distributed" along the ring. This is not just a matter of "gauge choice" because the 
electric field £ (r) = — A(r) is a gauge invariant quantity. So we have to know how the voltage is distributed along 
the ring. However, as we explain below, in linear response theory this information is not really required. Any voltage 
distribution that results in the same electro-motive force, will create the same current. 

In linear response theory the current is proportional to the rate in which the parameters are being changed in time. 
Regarding the values of A at different points in space as independent parameters, linear response theory postulates 
a linear relation between (J(r)) and £(r') that involves the conductivity matrix a{r,r') as a kernel. The current 
density has to satisfy the continuity equation V • (J(r)) — 0. From here it follows that if we replace A by A + VA(r), 
then after integration by parts we shall get the same current. This proves that within linear response theory the 
current should depend only on the electromotive force — <&, and not on the way in which the voltage is distributed. 
Note that A i-> A + VA(r) is not merely a gauge change: A gauge transformation of time dependent field requires a 
compensating replacement of the scalar potential (which is not the case here) . 

In the following it is convenient to think of a device which is composed of "quantum dot" with two long leads, and 
to assume that the two leads are connected together as to form a ring. We shall use the notation r — (r, s), where r 
is the coordinate along the ring, and s is a transverse coordinate. In particular we shall distinguish a left "section" 
r = r B and a right section r = r A of the two leads, and we shall assume that the dot region is described by a scattering 
matrix S a b- 

We further assume that all the voltage drop is concentrated across the section r = r B , and we measure the current I A 
through the section r = r A . With these assumptions we have two pairs of conjugate variables, which are ($ A ,X A ) and 
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($ b ,Zb). Note that the explicit expression for the current operator is simply 

Ik = e^(v 6{r - r A ) + <5(r - r» (803) 

where v is the r component of the velocity operator. We are interested in calculating the conductance, as define 
through the linear response relation (I A ) = — G AB <fr B . The Kubo expression takes the form 

g ab = ft trace j Za Im j G j Jb Im jqj . 804 s 

7T 

This is yet another version of the Kubo formula. Its advantage is that the calculation of the trace involves integration 
that is effectively restricted to two planes, whereas the standard version (previous section) requires a double integration 
over the whole " bulk" . 

= [19.5] From the Kubo formula to the Landauer formula 

Before we go on we recall that it is implicit that for finite system Im[G] should be "smeared". In the dot-leads setup 
which is described above, this smearing can be achieved by assuming very long leads, and then simply "cutting" them 
apart. The outcome of this procedure is that G ± is the Green function of an open system with outgoing wave (ingoing 
wave) boundary conditions. As customary we use a radial coordinate in order to specify locations along the lead, 
namely r = r a (r), with < r < oo. We also define the channel basis as 

(r,s\a,r) = Xo (») S(r - r«(r)) (805) 

The wavefunction in the lead regions can be expanded as follows: 

I*) = E ( C *-+ elkar + C a ,-e- iKr ) \a, r) (806) 

a,r 

We define projectors P + and P~ that project out of the lead wavefunction the outgoing and the ingoing parts 
respectfully. It follows that P+G+ = G+, and that P~G + = 0, and that G~P~ = etc. We define the operator 



r A = 2J k r A )nv a (a,r A \ (807) 

= 8(r-r A )<S> ^2 \ a ) hv a{a\ 



a£A 



where v a = (7jfc a /mass) is the velocity in channel a. The matrix elements of the second term in Eq.(808) are 

T A (s,s') = Y / Xa(s)hv aX :(s') (809) 

The operator T B is similarly defined for the other lead. Note that these operators commute with the projectors P . 
It is not difficult to realize that the current operators can be written as 

I A = (e/h)[-P+T A P+ + p-T A p-} (810) 

I B = {e/h)[+P + T B P+ - P-T B P-} (811) 

Upon substitution only two (equal) terms survive leading to the following version of Kubo formula: 



e 



2 



G BA = — trace [r B G+ T A G~] (812) 



2irh 
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There is a well known expression (Fisher-Lee) that relates the Green function between plane A and plane B to the S 
matrix. Namely: 



G + 0b,s a ) = -i^2xb(s B )—^=S ba —==x* a ( s A) 



a, . b 



hvh vhvn 



(813) 



Upon substitution we get 



~ 2irh 



G™ = ^ l^l^\S ba \ 



(814) 



aeAbeB 



This is the Landauer formula. Note that the sum gives the total transmission of all the open channels. 



[19.6] From the Kubo formula to the BPT formula 



It should be emphasized that the original derivations of the Landauer and the BPT formulas are based on a scattering 
formalism which strictly applies only in case of an open system (= system with leads which are connected to reservoirs). 
In contrast to that Kubo formula is derived for a closed system. However, it can be shown that by taking an appropriate 
limit it is possible to get the BPT formula from the Kubo formula. Namely, 



M _ 



trace [F k Im[G+] F j Im[G+]] 



47T 



-trace 



ast ds 

dxi dxj 



(815) 
(816) 



B 33 



ih 

~2^ 



trace [F 3 (G++G - ) F j Im[G+]] 



Airi 



: trace 



PA^- d Ss 



dxi 



dx-j 



intrf 



(817) 
(818) 



So the sum is 






(819) 



For more details see Phys. Rev. B 68, 201303(R) (2003). 
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[20] Irreversibility and Nonequilibrium processes 
= [20.1] The origin of irreversibility 



Assume an isolated system with Hamiltonian H(X), where X is a set of control parameters that determine the "fields" . 
For simplicity assume that at t = the system is in a stationary state. A driving process means that X = X(t) 
is changed in time. In particular a cycle means that A(< final ) = A(£=0). A driving process is called reversible is we 
can undo it. In the latter case the combined process (including the "undo") is a closed cycle, such that at the end 
of the cycle the system is back in its initial state. Generally speaking a driving cycle becomes reversible only in the 
adiabatic limit. Otherwise it is irreversible. 

Micro-reversibility.— One should not confuse reversibility with micro-reversibility The latter term implies that 
the mechanical evolution has time reversal symmetry (TRS). This TRS implies that if we could reverse that state of 
the system at some moment (and also the magnetic field if exists) then ideally the system would come back to its 
initial state. This is called Lodschmit Echo. In general it is impossible to reverse the sate of the system, and therefore 
in general micro-reversibility does not imply reversibility! 

Sudden process.— The irreversibility of typical systems is related to chaos. The simplest example is free expansion. 
In this example X is the location of a piston. At t — the system is prepared in an ergodic state, say a microcanonical 
state on the energy surface T-L(Xa) = E. The piston is moved outwards abruptly form X& to Xq. After some time 
of ergodization the system will become ergodic on H(Xb) = E. There is no way to reverse this process. 

Slow driving.— The more interesting scenario is a slow cycle. Using the assumption of chaos it can be argued that 
at the end of the cycle the state will occupy a shell around 'H(Xa) = E. If the system is driven periodically (many 
cycles), the thickness of this shell grows like y/Dsi with De oc X 2 . This diffusion in energy space implies (with some 
further argumentation) monotonic increase of the average energy. Thus irreversibility implies dissipation of energy: 
The system is heated up on the expense of the work which is being done by the driving source. 

Non equilibrium steady state.— Another reason for irreversibility is having a "frustrated" system that is connected 
to several baths, each in different temperature, as in the prototype problem of heat conduction. Typically, after a 
transient, a steady state is reached. But this steady state is not a canonical thermal equilibrium state. With such 
configuration one can associate a rate of "entropy production". 

= [20.2] The Boltzmann version of the second law 



There are two possible approaches to discuss irreversible processes. One approach, following Boltzmann, is based on 
the theory of ergodic motion, while the other is the thermodynamic formulation. The Boltzmann approach is based on 
a statistical definition of entropy. In order to avoid confusion we distinguish between Shanon entropy, Von-Neumann 
entropy, Boltzmann entropy and Thermodynamic entropy. All except the latter are defined by the same look alike 
formula S = — ^2 r p r log(p r ), but the context and the meaning of the p r is in general not the same. 

Information entropy:— If we have a probability distribution {p r } then S provides a measure for the uncertainty 
which is involved in our knowledge of the statistical state. This point of view that regards S as an information 
measure has been promoted by Shanon. In the quantum mechanical context we define "classical state" as implying 
100% certainty for any measurement. Such states do not exist in Nature. Rather the states of minimum uncertainty 
in N dimensional Hilbert space are the pure states, and they have finite information entropy. See http : //arxiv . 
org/abs/quant-ph/0401021. They should be contrasted with the worst mixed state whose entropy is S = log(iV). 

Von-Neumann entropy:— Von-Neumann has used a Shanon look-alike formula in order to characterize the purity 
of a quantum state. In the Von-Neumann definition, the p r are the weights of the pure states in the mixture. It is 
important to realize that the Von-Neumann entropy has nothing to do with the theory of irreversibility. If we consider 
for example the free expansion of a wavepacket in a big chaotic box, then we have S — at any moment. Still it is 
nice that the Von-Newman entropy of a canonical state coincides with the thermodynamic definition. 

Boltzmann entropy:— Boltzmann has defined S is a way that allows to discuss irreversibility. The idea was to 
divide the phase space of a system into small cells, and to characterize the statistical state in this representation 
using S. Then Boltzmann has proposed that S has the tendency to increase with time if the motion is chaotic (a 
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variation of this idea is the "H theorem" that refers to the reduced one-particle description of a gas particle). The 
same reasoning can be carried out in the quantum mechanical context where the division of Hilbcrt space into "cells" 
is defined by a complete set of projectors. Obviously in the latter context recurrences imply that the dynamics of a 
quantized closed chaotic system looks irreversible only for a limited time. 

State space:— The following visualization is useful. Any canonical state p of the system can be represented as a point 
in a (X, T) plane. We add a vertical axis for the energy. Hence in this extended representation the canonical states 
form a surface. We refer to this surface as the floor. Non-canonical states with the same entropy as the canonical 
state have a higher energy and accordingly are represented by points above the floor. In this space we can define 
adiabatic surfaces, that consist of states that have the same entropy. 

Thermodynamic process:— We visualize a thermodynamic process as a trajectory in the (X, T, E) space. A 
reversible quasi-static process is represented by a solid line that connects points A and B on the floor. An actual non- 
reversible process is represented by a dashed line that resides "above" the floor. Of particular interest are processes 
that start and end at the floor. According to Boltzmann the entropy during a process always increases. Loosely 
speaking this means that the probability at the end of the process is scattered on more " energy levels" . 

= [20.3] The thermodynamics version of the second law 



The Boltzmann entropy is a theoretical construct and hence the statistical version of the second law has no practical 
value. We have to "translate" both the definition of entropy and the "second law" into a thermodynamic language. 

Let us first discuss how define the "entropy" without relaying on Boltzmann. The first step is to characterize any 
thermal state by empirical temperature 9. This is well defined by the "zeroth law" of thermodynamics. The second 
step is to represent all the thermal states as points in a X — (X, 9) space. Now we can define "adiabatic surface" as 
the set of states that can be reached via a reversible adiabatic process that does not involve exchange of energy with 
the environment. We can label each surface by an number S(X), that we call "entropy" (with quotations marks). 
The convention would be that S[A] < S[B] if we can get from A to B via an irreversible process. 

If we have a reversible process that starts at point X, and ends at point X + dX the change in "entropy" is 
dS = V ' S ■ dX. At the same time we can write for the heat dQ = F ■ dX. By definition both dS — and dQ = 
define the same adiabatic surfaces. It follows that there is an "integration factor" such that F = T(X,9)'VS. We 
now postulate that there is a possibility to define S such that T is a function of 9 alone. This leads to the definition 
of "absolute temperature" and "thermodynamics entropy" without the use to relay on Boltzmann. 

Let us rephrase the thermodynamic postulate in a more illuminating way. Consider an isothermal process at temper- 
ature 9h that connects two adiabatic surfaces. Consider a second isothermal process at temperature 9c that connects 
the same surfaces. To say that dQ has an integration factor that depends on 9 alone means that the ratio dQn /dQc 
depends only on the temperatures 9h and 9c- Hence we can define "absolute temperature" using the definition of 
Carnot. This also fix the definition of S up to a constatnt. 

The connection with the canonical formalism is quite transparent. Using ideal gas thermometer we have identified 
9 = 1/(3, and then we showed th'&tdQ = ^2 r dp r E r has an integration factor T — 1/(3 as postulated in thermodynamics. 
During a reversible quasi-static process the change of the entropy of a system a is JdQ a /T a , while that of a second 
system is JctQ^/Tf,. If we have T a = Tf, = 9, it follows that the total entropy change has an additive property, hence 
entropy is an extensive quantity. 

= [20.4] The Clausius version of the second law 

The Boltzmann-Gibbs entropy is defined as S — — ^2 r PrAnp r , where p r is the probability to be in the rth cell in 
phase space. The Boltzmann version of the second law states that after any process from state "A" to state "B" 

guniverse^ _ guniverse^ > q ^q) 

In order to translate this microscopic formulation into the practical language of thermodynamics one assumes: (1) In 
the initial and final states the system is in equilibrium with bodies that have well defined temperatures Ta and 
Tg respectively; (2) During the process the system absorbs heat from equilibrated bodies that have well defined 
temperatures, while the system itself might be out of equilibrium; (3) The change in the entropy of an equilibrated 
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body that has a well defined temperature T is —dQ/T, where dQ is the heat transfer with the usual sign convention. 
With these assumptions we get the thermodynamic version of the second law: 

S ays [B]-S avs [A]] - f -^— > (821) 

J A ■'■baths 

In particular for a closed cycle we get the Clausius inequality 

Entropy production = — (t — > (822) 

J -'-baths 

Kelvin statement:— The immediate implication of the Clausius inequality is that there exist no process whose sole 
result is to transfer heat into work. If such process existed one would have dQ > at any stage of the cycle and hence 
the total entropy of the universe would decrease. Also the inverse statement is true: if it were possible to device a 
machine that violates Clausius inequality, then it would be possible to violate Kelvin's statement. The proof is based 
on the possibility to combine such a device with a Carnot engine. 

Direction of heat flow:— The next simplest application of the Clausius inequality is related to a cycle (AB)# in 
which the system is in contact with Ta, and later in contact with Tg (work is not involved). The result of such cycle 
is the transfer of an amount q of energy from Ta to Tg . Assuming Tg < Ta it follows from the Clausius inequality 
that q must be positive, which loosely speaking means that heat can flow only from the high to the low temperature. 

Maximum work principle:— Consider an isothermal process. We use the standard assumptions: the temperature 
of the bath is To, the initial state is equilibrium, and also at the end of the process we wait until an equilibrium is 
reached. Using the first law of thermodynamics (energy conservation) we can substitute Q = W + (E(B) — E(A)), 
where W is defined through this expression as the work done by the system. Using F(A) = E(A) — T S(A) and 
F(B) = E(B) - T S(A) we get from the second law 

Work done by the system = W < F(A) - F(B) (823) 

Obviously in order to extract "positive" work we need F(A) > F(B). The maximum work that can be extracted 
is equal to the free energy difference. A trivial implication is that if the universe included only one bath with one 
temperature, it would not be possible to extract work from a closed cycle. 

Irreversible work:— In practice, assuming that the state of the system is canonical-like at any instant of time, with 
a well define temperature T sys at any moment along the A t— > B process, we can calculate the change in the entropy 
of the system as an integral over dS sys — (dQ +dW iIICVCISiblo )/T sys , leading to 



B <?W,„ ovorslblo [ B f 1 1 



-*■ sys J A \ sys -*- baths 



dQ > (824) 



We see that the origin of reversibility is (i) irreversible work, e.g. frictional effects; (ii) temperature difference between 
the system and the bath during heat conduction. Additionally, based on the same expression for dS sys , we can write 

Y, d Pr E r = / T sys dS sys = Q + W lrrcvcralblo (825) 

r J A 

This is the change in the energy which is resulted from transitions between levels. These transitions are induced by 
the coupling to the environment and/or by the non-adiabaticity of the driving. It is complementary to the reversible 
work which is obtained from integrating dW = ydX along the An- B process. 
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[20.5] The Carnot Cycle paradigm 



A strict adiabatic process is a quasi-static process during which the system is totally isolated from the environment. 
For such process we have the adiabatic theorem. Namely, assuming that the motion is chaotic the system that has 
been prepared with definite energy E will remain on the the same adiabatic surface (classical version) or in the 
same energy level (quantum version) if a parameter X is being changed very slowly. If the system is prepared with 
probability p n in some energy shell (classical) or energy level (quantum) then this probability will not change during 
the process, and hence also the entropy will remain constant. In the classical version n is the phase space volume of 
the evolving energy surface, while in the quantum mechanical formulation it is the index that labels the energy levels. 
In the classical limit n € [0, oo], and the associated energy is denoted as E n (X). 

We can represent all the possible canonical states of a system as points in (X, T) space. The thermo-adiabatic lines 
connects points that have the same entropy. A thermo-adiabatic process is defined as a quasi-static process along a 
thermo-adiabatic line. We can think of such process as composed of many infinitesimal steps, where each step consists 
of a strict adiabatic process followed by a contact interaction with a bath that has the appropriate temperature. To 
see that the quesi-static limit exists, note the following: If a system is prepared in a canonical state E n (Xo), Then 
its energy after a strict adiabatic process is E* > E n (X) for any X away from Xq. For a small variation dX the 
energy difference can be expanded as dE* ex dX 2 . If after such a variation the system is connected to a bath that 
has the appropriate temperature, it would relax to a canonical state with the same entropy. This relaxation involves 
an entropy production dS = dE* jT . Integrating dS over the whole process we see that in the quesi-static limit the 
entropy production is zero. 

A strict Carnot cycle involves only two heat baths. The cycle (ABB*CDD*) is illustrated in the Figure. The initial 
preparation is canonical at A(Ti,Xa). The process from A(Ti,X^) to B*(Xg) is strictly adiabatic. At the end of 
this stage the obtained state is not canonical. The process from B*(Xb) to B(Xb,T 2 ) is the equilibration due to 
contact with a bath that has the temperature T 2 . It is an irreversible relaxation process in which the system goes to 
a lower energy with the same entropy. At the end of this process the obtained state is canonical. The process form 
B(Xb,T 2 ) to C(Xc,T 2 ) is quasi-static in contact with the same heat bath. The process from C(Xc,T 2 ) to D*(Xr>) 
is strictly adiabatic. The process from D*(Xr>) to D(Xd,Ti) and later back to A(T\,Xa) is in contact with the heat 
bath T x . 





T^ 



A 



thermo-adiabatic line 

D 
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[20.6] Fluctuations away from equilibrium 



It is customary to say that at equilibrium the expectation value of an observable reflects the typical value of this 
observable, while the fluctuations are relatively small. If the central limit theorem applies the RMS/mean should 
scale as 1/yN. However, we may insist to study the statistics of the tails. These might reveal interesting information 
about the underlying dynamics. 

If a system, that has a constant density of states, were isolated, then it would be natural to expect that the dis- 
tribution function of quantities like work and entropy production would be symmetric: P(—s) = P(s). But in the 
thermodynamic context it is more likely to have 

P{-s) = c _,3s P(s), [^-symmetric distribution] (826) 

It follows that P(s) can be written as a product of a symmetric function and an exponential factor e^ s ' 2 . Another 
consequence of the /3-symmetry is 

(c~ ,9s ) = 1, [convex average] (827) 

The latter equality can be re-phrased as follows: In analogy with the definition of harmonic average and geometric 
average that are defined as the inverse of ((1/s)) and as the exp of (log(s)) respectively, here we can define a convex 
average that is defined as the log of the above expression. The convex average is zero for a /3-symmctric distribution, 
while the standard algebraic average is positive 

(s) > 0, [convex inequality] (828) 

While for a symmetric distribution the average value (s) has to be zero, this is no longer true for a /3-symmctric 
distribution. Rather the average should be related to the variance. If we assume that s has Gaussian distribution 
with average value (s) — /i and variance Var(s) = cr 2 , then /3-symmetry requires 

(s) = -j3 Var(s), ["fluctuation dissipation" relation] (829) 

This is like a fluctuation dissipation relation. We can formalize this relation for non-Gaussian distribution in terms 
of comulant generating function g (A) which is defined through 

(e- As ) = e gW (830) 

Note that due to normalization g(0) = 0, while g'(0) = —[i and g"(0) = a 2 . In particular for a Gaussian 
g(X) = —jj,X + (1/2)ct 2 A 2 . For a symmetric distribution g(—\) = <?(A). But for /3-symmetry we must have 

g(/3 — A) = g(A), [characterization of /3-symmetric distribution] (831) 

Again we see that for a Gaussian /3-symmetry implies a relation between the mean and the variance. 

= [20.7] The Crooks relation and Jarzynski equality 

The Crooks relation and Jarzynski equality regards the probability distribution of work done during a non-equilibrium 
process that connects canonical states A and B. 

Let us have in mind a gas in cylinder with a movable piston. Initially the piston is in position A, and the gas in 
equilibrium with temperature Tq. The canonical probabilities are 



Pr(A) = -^f' 3EAA) - exp 



E r (A)-F(A) 



Tn 



(832) 
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Now wc displace the piston to position B doing work W. After that we can optionally allow the system to relax to 
the bath temperature To, but this no longer affects W. The distribution of work is defines as 



P{W) 



Y,Pr{A) 5(w - (E r (B) - E r (A))) 



(833) 



It is implicit here that we assume a conservative system with a well-defined invariant measure that allows division of 
phase space into "cells". The states E r (B) are associated with E r (A) through the dynamics in a one-to-one manner. 
Note that if the dynamics is non-adiabatic the order of the cells in energy space is likely to be scrambled. It follows 
from the above definition that 



cxp 



W 
~T 



1 



Z(A) 



^ e -(l/T ) Er (A) exp 



E r (B)-E r (A) 
Tn 



cxp 



F(B) F(A) 

Tn 



(834) 



This is the Jarzynski equality. It can be regarded as a refinement of the inequality W > (F(B) — F(A)), and it 
reduces to W = (F(B) — F(A)) in the the case of a quasi-static adiabatic process. 

Crooks relation.— The Jarzynski equality can be regarded as the outcome of the Crooks relation. We can define 
a probability distribution P rev (W) for a process that starts with a canonical state B, in complete analogy with the 
definition of P(W). Now the probabilities p r (A) are replaced by p r (B). It follows that 



Prev(-W) 



exp 



W - (F(B) - F(A)) 



Tn 



P{W) 



(835) 



The Jarzynski equality follows if we integrate both sides over W. In the standard references the phrasing of the 
Crooks relation is a somewhat more involved. One defines a joint probability distribution 



P(E b ,E a ) = Yv r (A)8[E a -E r {A))5[E b -E r {B) 



(836) 



and the statement becomes 



P(E + W,E) 

P rev (E,E + W) 



cxp 



W - (F(B) - F(A)) 



(837) 



The visualization of this inequality is as follows: Define the source as a shell of thickness dE a around E a — E, and 
the destination as a shell with thickness dE b around E b — E + W. Denote by N ex dE a dE b the number of cells that 
are connected by the dynamics. The probability to observe a process from E a to E b is Np r (A). The probability to 
observe an analogous reversed process is Np r (B). The ratio is p r (A)/p r (B), leading to the Crooks relation. Finally 
we note that in the quantum context the object that corresponds to P{W) is 



Pa-, 



(Lu) = J2Pn(A) \(m(B)\U(t)\n(A))\ 2 s(u - (E m (B) - E n (A))) 



(838) 



where U(t) is the evolution operator. The Crooks relation in this context can be regarded as a straightforward 
variation on the derivation of the "detailed balance relation" that applies to a spectral functions S(uj). 

The FD relation.— Let us see what is the implication on the Crooks relation with regard to a simple closed cycle 
for which F(B) = F(A). In such case P(W) is a /3-symmetric distribution. It follows that there is a "fluctuation 
dissipation relation" 



(W) = ^Var(W) 



(839) 
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Considering a multi-cycle process Var(W) = 2£>e£ and (W) = Wt, leading to the dissipation-diffusion relation that 
we have derived in past lecture VV = (1/T)De, from which follows the dissipation-fluctuation relation 77 = v/(2T). 



[20.8] The nonequilibrium fluctuation theorem 



The nonequilibrium fluctuation theorem (Evans, Cohen, Morris, Searles, Gallavotti) regards the probability distribu- 
tion of the entropy production during a general non-equilibrium process. 

The clearest formulation of this idea assumes that the dynamics is described by a rate equation. The transition rates 
between state n and state m satisfies 



w(n\m) 
w{m\n) 



exp 



T 

-*- 71:111. 



(840) 



Where T nm is the temperature that controls the nm transition. We can regard the rate equation as describing 
a random walk process. Consider a trajectory x(t). If the particle makes a transition from m to n the entropy 
production is (E m — E n )/T nm . Accordingly 



P[x(-t)] 



exp 



AS[x] 



(841) 



As an example for the practicality of this relation consider the current fluctuations in a ring. The fluctuation theorem 
implies that the ratio P(I)/P(—I) at non-equilibrium steady state should equal the above Boltzmann factor, where 
S[x] is the entropy production due to a particle that encircles the ring. 

Heat conduction.— A prototype application of the above relation is to the analysis of heat flow form hot bath Th 
to cold bath Tc- The temperature difference is e = Tjj — Tc- We assume that the conductor that connects the two 
baths can be modeled using a master equation. The transition between states of the conductor arc induced by the 
bath and are like a random walk. With any trajectory we can associate quantities Qh and Qc that represent that 
heat flow from the baths into the conductor. From the fluctuation theorem it follows that 



P(Qh,Qc) 
P(-Qh,-Qc) 



exp 






T h 



(842) 



Next we define the absorbed energy Q — Qh + Qc and the heat flow Q = (Qh 
time limit Q ~ t while the fluctuations of Q are bounded. Accordingly we get 



Qc)/2- We realize that in the long 



P(Q) 
P(-Q) 



exp 



1 



1 



T c T H 



Q 



(843) 



If wc use a Gaussian approximation, we get a "fluctuation-dissipation" relation 



(Q) = U^-^nQ) 



(844) 



The relation can be linearized with respect to e = Tjj — Tc- The thermal conductance is defined through (Q) = Ke x t, 
and the intensity of fluctuations through Var(Q) = v x t. Thus we deduce that 



(Q) = Kx(Th-T c ), 



with K = ^^v 



(845) 
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